WO2020051852A1 - Method for recording and displaying information in communication process, and terminals - Google Patents

Method for recording and displaying information in communication process, and terminals Download PDF

Info

Publication number
WO2020051852A1
WO2020051852A1 PCT/CN2018/105571 CN2018105571W WO2020051852A1 WO 2020051852 A1 WO2020051852 A1 WO 2020051852A1 CN 2018105571 W CN2018105571 W CN 2018105571W WO 2020051852 A1 WO2020051852 A1 WO 2020051852A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice data
terminal
interface
text
information
Prior art date
Application number
PCT/CN2018/105571
Other languages
French (fr)
Chinese (zh)
Inventor
王骅
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to PCT/CN2018/105571 priority Critical patent/WO2020051852A1/en
Priority to CN201880090760.2A priority patent/CN111819830B/en
Publication of WO2020051852A1 publication Critical patent/WO2020051852A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/725Cordless telephones

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Telephone Function (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

Disclosed are a method for recording and displaying information in a communication process, and terminals, which relate to the technical field of communications and can improve the relevance between speech data recorded during the process of speech communication and a call record of the speech communication. The specific solution involves: during the process of a first terminal performing speech communication with a second terminal, the first terminal identifying speech data captured by a microphone of the first terminal; if text corresponding to first speech data captured by the microphone matches a preset startup word, the first terminal starting to record speech data of the second terminal; upon completion of speech data recording, the first terminal displaying a first interface comprising the text corresponding to the speech data; and the first terminal receiving a first operation of a user on the text corresponding to the speech data, and playing the speech data in response to the first operation.

Description

一种通信过程中信息的记录及显示方法及终端Method and terminal for recording and displaying information in communication process 技术领域Technical field
本申请实施例涉及通信技术领域,尤其涉及一种通信过程中信息的记录及显示方法及终端。Embodiments of the present application relate to the field of communication technologies, and in particular, to a method and terminal for recording and displaying information during communication.
背景技术Background technique
随着电子技术的发展,电子终端(如手机)的功能越来越多,用户对手机的依赖性也越来越强。例如,手机不仅可以作为一种通信或者娱乐工具,还可以包括备忘录功能和录音机,即用户可以在手机的备忘录中记录一些文字或者图片信息,在手机的录音机中记录一些语音信息。With the development of electronic technology, electronic terminals (such as mobile phones) have more and more functions, and users are increasingly dependent on mobile phones. For example, the mobile phone can not only be used as a communication or entertainment tool, but also include a memo function and a voice recorder, that is, the user can record some text or picture information in the mobile phone's memo and record some voice information in the mobile phone's recorder.
为了方便用户在使用手机进行语音通信的过程中,可以记录与本次语音通信相关的信息,通常情况下可以在手机的语音通信界面集成备忘录和录音机的入口(如图1所示,手机100的语音通信界面101中包括备忘录的开启按钮103和录音机的开启按钮102),使得用户可以在进行语音通信的过程中,直接通过该入口启动备忘录或者录音机,通过备忘录或者录音机记录与本次语音通信相关的信息。In order to facilitate the use of mobile phones for voice communication, users can record information related to this voice communication. Generally, the memo and the entrance of the recorder can be integrated in the voice communication interface of the mobile phone (as shown in Figure 1, the mobile phone 100 The voice communication interface 101 includes the start button 103 of the memo and the start button 102 of the recorder, so that the user can directly start the memo or recorder through the entrance during the voice communication process, and record the relevant voice communication through the memo or recorder. Information.
但是,在上述语音通话结束后,如果用户想要查看通过备忘录或者录音机记录记录的信息,该用户还需要启动手机中的备忘录或者收音机才可以查看对应的信息。并且,用户在查看备忘录或者收音机记录的信息时,可能需要翻看或者收听多项记录,才可以找到与本次语音通话相关的信息,用户体验不佳。However, after the above voice call ends, if the user wants to view the information recorded through the memo or recorder record, the user also needs to activate the memo or radio in the mobile phone to view the corresponding information. In addition, when viewing the information recorded by the memo or the radio, the user may need to browse or listen to multiple records to find information related to the voice call, and the user experience is poor.
发明内容Summary of the Invention
本申请实施例提供一种通信过程中信息的记录及显示方法及终端,可以提高在进行语音通信的过程中录制的语音数据与该语音通信的通话记录的关联性。The embodiments of the present application provide a method and a terminal for recording and displaying information during a communication process, which can improve the correlation between the voice data recorded during the voice communication and the call record of the voice communication.
第一方面,本申请实施例提供一种通信过程中信息的记录及显示方法,可以应用于第一终端与第二终端进行语音通信的过程中。第一终端可以在第一汇总段与第二终端进行语音通信的过程中,识别第一终端的麦克风捕获的语音数据;若麦克风捕获的第一语音数据对应的文本与预置启动词匹配,则第一终端开始录制第二终端的语音数据;第二终端的语音数据是由接收自第二终端的音频电信号转换而来的;在语音数据录制结束后,第一终端显示包括录制的语音数据对应的文本的第一界面;第一终端接收用户对语音数据对应的文本的第一操作,响应于第一操作播放录制的语音数据。In a first aspect, an embodiment of the present application provides a method for recording and displaying information during a communication process, which can be applied to a process in which a first terminal performs voice communication with a second terminal. The first terminal may identify the voice data captured by the microphone of the first terminal during the voice communication between the first summary segment and the second terminal; if the text corresponding to the first voice data captured by the microphone matches the preset start word, then The first terminal starts recording the voice data of the second terminal; the voice data of the second terminal is converted from the audio and electrical signals received from the second terminal; after the voice data recording ends, the first terminal displays the recorded voice data The first interface of the corresponding text; the first terminal receives the user's first operation on the text corresponding to the voice data, and plays the recorded voice data in response to the first operation.
本申请实施例中,第一终端与第二终端进行语音通信的过程中,当第一终端的麦克风捕获的第一语音数据对应的文本与预置启动词文本匹配时,第一终端可以自动开始录制语音通信对应的语音数据。换言之,第一终端在接收用户发出的语音命令(即文本与预置启动词文本匹配的第一语音数据)后,便可以自动开始录制语音通信对应的语音数据。第一终端可以在语音数据录制结束后,显示包括语音数据对应的文本的第一界面。即第一终端可以直观的向用户展示第一终端录制的语音数据对应的文本信息。并且,第一终端可以响应于用户对语音数据对应的文本的第一操作播放语音数据, 提高了文本信息与语音数据的关联性。In the embodiment of the present application, during the voice communication between the first terminal and the second terminal, when the text corresponding to the first voice data captured by the microphone of the first terminal matches the text of the preset start word, the first terminal may automatically start Record the voice data corresponding to the voice communication. In other words, after receiving the voice command (that is, the first voice data whose text matches the text of the preset start word) issued by the user, the first terminal can automatically start recording the voice data corresponding to the voice communication. The first terminal may display a first interface including text corresponding to the voice data after the voice data recording ends. That is, the first terminal can intuitively display the text information corresponding to the voice data recorded by the first terminal to the user. In addition, the first terminal may play the voice data in response to the user's first operation on the text corresponding to the voice data, thereby improving the relevance of the text information and the voice data.
综上所述,第一终端响应于用户在语音通话过程中发出的语音命令,可以自动录制对应的语音数据。通过本方案,使得终端更加智能化,并且提高了终端与用户的交互性能,提升了用户体验。In summary, the first terminal may automatically record corresponding voice data in response to a voice command issued by a user during a voice call. This solution makes the terminal more intelligent, improves the interaction performance between the terminal and the user, and improves the user experience.
结合第一方面,在一种可能的设计方式中,第一终端可以在语音数据录制结束后,自动显示所述第一界面。或者,第一终端可以响应于语音通信结束,显示第一界面;其中,语音通信结束时语音数据录制已经结束,或者语音通信结束时第一终端结束录制语音数据。或者,第一终端可以在语音数据录制结束后,响应于用户输入第二操作,显示所述第一界面。该第二操作用于指示第一终端显示第一终端的通话记录界面。该第一界面为通话记录界面。该通话记录界面包括上述语音通信的通话记录项。该通话记录项用于记录上述语音通信的通话记录信息。该通话记录项中包括录制的语音数据对应的文本。或者,第一终端可以在语音数据录制结束后,响应于用户对通话记录界面中语音通话的通话记录项的第三操作,显示第一界面。该第三操作用于指示第一终端显示语音通信的记录详情界面。该第一界面是记录详情界面。该记录详情界面用于显示语音通信的通话记录信息。该记录详情界面中包括录制的语音数据对应的文本。With reference to the first aspect, in a possible design manner, the first terminal may automatically display the first interface after voice data recording ends. Alternatively, the first terminal may display the first interface in response to the end of the voice communication, where the voice data recording has ended when the voice communication ends, or the first terminal ends the voice data recording when the voice communication ends. Alternatively, after the voice data recording ends, the first terminal may display the first interface in response to the user inputting a second operation. This second operation is used to instruct the first terminal to display the call history interface of the first terminal. The first interface is a call history interface. The call record interface includes the call record items of the voice communication. The call record item is used to record the call record information of the voice communication. The call log entry includes text corresponding to the recorded voice data. Alternatively, after the voice data recording ends, the first terminal may display the first interface in response to the user's third operation on the call record item of the voice call in the call record interface. This third operation is used to instruct the first terminal to display a record detail interface of voice communication. The first interface is a record detail interface. The record details interface is used to display call record information for voice communication. The record details interface includes text corresponding to the recorded voice data.
结合第一方面,在另一种可能的设计方式中,第一终端不仅可以录制第二终端的语音数据,还可以录制第一终端的麦克风捕获的语音数据。本申请实施例的方法还可以包括:若麦克风捕获的第一语音数据对应的文本与所述预置启动词匹配,则第一终端开始录制麦克风捕获的语音数据。本申请实施例中,第一终端不仅可以录制第二终端的语音数据,还可以录制第一终端的麦克风捕获的语音数据。即第一终端可以录制主叫方与被叫方的对话内容。With reference to the first aspect, in another possible design manner, the first terminal can record not only the voice data of the second terminal, but also the voice data captured by the microphone of the first terminal. The method in the embodiment of the present application may further include: if the text corresponding to the first voice data captured by the microphone matches the preset start word, the first terminal starts recording the voice data captured by the microphone. In the embodiment of the present application, the first terminal can record not only the voice data of the second terminal but also the voice data captured by the microphone of the first terminal. That is, the first terminal can record the conversation between the calling party and the called party.
结合第一方面,在另一种可能的设计方式中,上述录制的语音数据对应的文本包括至少两段文本信息。上述录制的语音数据包括至少两段语音数据。该至少两段文本信息与至少两段语音数据一一对应。第一终端可以接收用户对至少两段文本信息中的第一文本信息的第一操作;响应于第一操作,第一终端播放第一语音数据段。第一文本信息为至少两段文本信息中的一段文本信息。第一文本信息与第一语音数据段对应。With reference to the first aspect, in another possible design manner, the text corresponding to the recorded voice data includes at least two pieces of text information. The recorded voice data includes at least two pieces of voice data. The at least two pieces of text information correspond to at least two pieces of voice data. The first terminal may receive a user's first operation on the first text information in at least two pieces of text information; in response to the first operation, the first terminal plays the first voice data segment. The first text information is a piece of text information among at least two pieces of text information. The first text information corresponds to a first voice data segment.
其中,录制的语音数据对应的文本可以包括至少两段文本信息。第一终端可以接收用户对至少两段文本信息中任一段文本信息的第一操作,播放对应的语音数据段。即用户可以选择性的控制第一终端播放第一终端录制的至少两段语音数据中的任一语音数据段。The text corresponding to the recorded voice data may include at least two pieces of text information. The first terminal may receive a user's first operation on at least two pieces of text information and play a corresponding voice data segment. That is, the user can selectively control the first terminal to play any one of the at least two pieces of voice data recorded by the first terminal.
结合第一方面,在另一种可能的设计方式中,第一终端将录制的语音数据转换为文本信息时,转换得到的文本信息与第一终端录制的语音数据的文本可能并不完全一致。即第一终端转换得到的文本信息可能会出现一些错误。基于这种情况,第一终端可以接收用户对第二文本信息的第四操作(即修改操作),该第四操作用于将第二文本信息修改为第三文本信息。第二文本信息为至少两段文本信息中的一段文本信息,第二文本信息与第二语音数据段对应。响应于第四操作,第一终端将第二文本信息修改为第三文本信息。其中,用户可以控制第一终端播放第二文本信息对应的第二语音数据段,对比第一终端播放的第二语音数据段和第一终端显示的第二文本信息。当第二文本信息与第一终端播放的第二语音数据段不一致时,操作第一终端修改第二文本 信息。在修改了第二文本信息后,第一终端可以在第一界面上显示第三文本信息。第一终端接收用户对第三文本信息的第一操作,播放第二语音数据段(即第二文本信息对应的语音数据段)。With reference to the first aspect, in another possible design manner, when the first terminal converts the recorded voice data into text information, the converted text information may not be completely consistent with the text of the voice data recorded by the first terminal. That is, some errors may occur in the text information converted by the first terminal. Based on this situation, the first terminal may receive a fourth operation (ie, a modification operation) performed by the user on the second text information, and the fourth operation is used to modify the second text information into the third text information. The second text information is a piece of text information among at least two pieces of text information, and the second text information corresponds to the second voice data segment. In response to the fourth operation, the first terminal modifies the second text information into the third text information. The user can control the first terminal to play the second voice data segment corresponding to the second text information, and compare the second voice data segment played by the first terminal with the second text information displayed by the first terminal. When the second text information is inconsistent with the second voice data segment played by the first terminal, the first terminal is operated to modify the second text information. After the second text information is modified, the first terminal may display the third text information on the first interface. The first terminal receives a user's first operation on the third text information, and plays a second voice data segment (that is, a voice data segment corresponding to the second text information).
本申请实施例中,第一终端可以响应于用户对文本信息的修改操作,采用修改后的文本信息替换修改前的文本信息。如此,用户则可以根据第一终端中保存的语音数据修改第一终端转换语音数据得到的文本信息,可以修正第一终端转换语音数据得到文本信息时出现的错误。In the embodiment of the present application, the first terminal may respond to a user's operation of modifying text information, and replace the text information before modification with the modified text information. In this way, the user may modify the text information obtained by the first terminal converting the voice data according to the voice data stored in the first terminal, and may correct an error that occurs when the first terminal converts the voice data to obtain the text information.
结合第一方面,在另一种可能的设计方式中,录制的语音数据对应的文本可以包括至少两段文本信息。第一终端在第一界面显示录制的语音数据对应的文本时,可以按照录制文本信息对应的语音数据段的时间先后顺序,以及文本信息对应的语音数据段的来源信息,在第一界面显示至少两段文本信息。其中,来源信息用于指示语音数据段是麦克风捕获的语音数据,或者第二终端的语音数据。With reference to the first aspect, in another possible design manner, the text corresponding to the recorded voice data may include at least two pieces of text information. When the first terminal displays the text corresponding to the recorded voice data on the first interface, the first terminal may display at least the first interface in accordance with the chronological order of the voice data segments corresponding to the recorded text information and the source information of the voice data segments corresponding to the text information. Two text messages. The source information is used to indicate that the voice data segment is voice data captured by a microphone or voice data of a second terminal.
其中,第一终端按照录制文本信息对应的语音数据段的时间先后顺序,以及文本信息对应的语音数据段的来源信息,在第一界面显示至少两段文本信息。这样,可以清楚的向用户展示主叫方与被叫方的对话内容。The first terminal displays at least two pieces of text information on the first interface in accordance with the chronological order of the voice data segments corresponding to the recorded text information and the source information of the voice data segments corresponding to the text information. In this way, the content of the conversation between the calling party and the called party can be clearly displayed to the user.
结合第一方面,在另一种可能的设计方式中,第一终端可以在开始录制语音数据时,向用户提示第一终端开始录制语音数据。具体的,如果麦克风捕获的第一语音数据对应的文本与预置启动词匹配,第一终端则可以发出第一提示信息。该第一提示信息用于提示用户第一终端开始录制语音数据。该第一提示信息为提示音或者振动提示。With reference to the first aspect, in another possible design manner, the first terminal may prompt the user to start recording the voice data when the first terminal starts to record the voice data. Specifically, if the text corresponding to the first voice data captured by the microphone matches a preset start word, the first terminal may send a first prompt message. The first prompt information is used to prompt the user to start recording voice data on the first terminal. The first prompt information is a prompt sound or a vibration prompt.
本申请实施例中,第一终端可以在第一语音数据对应的文本与预置启动词匹配时,即第一终端开始录制语音数据时,发出第一提示信息,以提示用户第一终端开始录制语音数据。如此,用户便可以通过该第一提示信息获知第一终端已经开始录制语音数据,增加了第一终端与用户直接的互动,提升了第一终端与用户的交互性能,提升用户体验。In the embodiment of the present application, the first terminal may send a first prompt message when the text corresponding to the first voice data matches a preset start word, that is, when the first terminal starts recording the voice data, to prompt the user to start recording on the first terminal. Voice data. In this way, the user can know through the first prompt information that the first terminal has started recording voice data, which increases the direct interaction between the first terminal and the user, improves the interaction performance between the first terminal and the user, and improves the user experience.
结合第一方面,在另一种可能的设计方式中,在第一终端显示第一界面之前,本申请实施例的方法还可以包括:在录制语音数据过程中,若麦克风捕获的第二语音数据对应的文本与预置结束醒词匹配,则第一终端停止录制语音数据。With reference to the first aspect, in another possible design manner, before the first terminal displays the first interface, the method in the embodiment of the present application may further include: during the recording of the voice data, if the second voice data captured by the microphone is If the corresponding text matches the preset end word, the first terminal stops recording voice data.
其中,在第一终端录制语音数据的过程中,当第一终端的麦克风捕获的第二语音数据对应的文本与预置结束词文本匹配时,第一终端可以自动停止录制语音数据。换言之,第一终端在接收用户发出的语音命令(即文本与预置结束词文本匹配的第二语音数据)后,便可以自动停止录制语音数据。In the process of recording the voice data by the first terminal, when the text corresponding to the second voice data captured by the microphone of the first terminal matches the text of the preset ending word, the first terminal may automatically stop recording the voice data. In other words, the first terminal can automatically stop recording voice data after receiving a voice command (that is, second voice data whose text matches a preset end word text) issued by the user.
综上所述,第一终端响应于用户在语音通话过程中发出的语音命令,可以自动录制和停止录制语音数据。通过本方案,使得终端更加智能化,并且提高了终端与用户的交互性能,提升了用户体验。In summary, the first terminal can automatically record and stop recording voice data in response to a voice command issued by a user during a voice call. This solution makes the terminal more intelligent, improves the interaction performance between the terminal and the user, and improves the user experience.
结合第一方面,在另一种可能的设计方式中,第一终端还可以在停止录制语音数据时,向用户提示第一终端停止录制语音数据。具体的,在录制语音数据过程中,若麦克风捕获的第二语音数据对应的文本与预置结束醒词匹配,第一终端可以发出第二提示信息。该第二提示信息用于提示用户第一终端停止录制语音数据。该第二提示信息为提示音或者振动提示。With reference to the first aspect, in another possible design manner, when the first terminal stops recording voice data, the user may be prompted by the first terminal to stop recording voice data. Specifically, during the recording of the voice data, if the text corresponding to the second voice data captured by the microphone matches a preset end word, the first terminal may send a second prompt message. The second prompt information is used to prompt the user to stop recording the voice data by the first terminal. The second prompt information is a prompt sound or a vibration prompt.
本申请实施例中,第一终端可以在第二语音数据对应的文本与预置启动词匹配时,即第一终端停止录制语音数据时,发出第二提示信息,以提示用户第一终端停止录制语音数据。如此,用户便可以通过该第而提示信息获知第一终端已经停止录制语音数据,增加了第一终端与用户直接的互动,提升了第一终端与用户的交互性能,提升用户体验。In the embodiment of the present application, the first terminal may send a second prompt message when the text corresponding to the second voice data matches a preset start word, that is, when the first terminal stops recording voice data, to prompt the user to stop recording by the first terminal. Voice data. In this way, the user can learn through the first prompt message that the first terminal has stopped recording voice data, which increases the direct interaction between the first terminal and the user, improves the interaction performance between the first terminal and the user, and improves the user experience.
结合第一方面,在另一种可能的设计方式中,在第一终端使用扬声器播放语音数据的情况下,当上述第一提示信息或者第一提示信息为提示音时,第一终端发出的提示音可能会被第一终端的麦克风捕获到。在这种情况下,第一终端在第一语音数据对应的文本与预置启动词匹配时,如果确定第一终端使用扬声器播放语音数据,第一终端则可以根据扬声器播放的语音数据对麦克风采集的语音数据进行回声抑制。With reference to the first aspect, in another possible design manner, when the first terminal uses a speaker to play voice data, when the first prompt information or the first prompt information is a prompt tone, the prompt issued by the first terminal The tone may be captured by the microphone of the first terminal. In this case, when the first terminal matches the text corresponding to the first voice data with the preset start word, if it is determined that the first terminal uses the speaker to play the voice data, the first terminal may collect the microphone according to the voice data played by the speaker. Voice data for echo suppression.
这样,回声抑制后的语音数据中则不会包括上述提示音。第一终端向第二终端发送的语音数据为回声抑制后的语音数据。第一终端向第二终端发送的语音数据则不会包括上述提示音。第二终端的用户则不会听到上述提示音。In this way, the voice data after the echo suppression will not include the above-mentioned prompt tone. The voice data sent by the first terminal to the second terminal is voice data after echo suppression. The voice data sent by the first terminal to the second terminal does not include the above-mentioned prompt tone. The user of the second terminal will not hear the above-mentioned prompt tone.
进一步的,第一终端在播放第二提示信息后,可以停止对麦克风采集的语音数据进行回声抑制。这样,可以避免第一终端持续执行回声抑制所产生的功耗,延长第一终端的续航时间。Further, after playing the second prompt information, the first terminal may stop performing echo suppression on the voice data collected by the microphone. In this way, the power consumption caused by the first terminal continuously performing echo suppression can be avoided, and the battery life of the first terminal can be extended.
结合第一方面,在另一种可能的设计方式中,第一终端可以按照录制每段语音数据的时间先后顺序,以及每段语音数据的来源信息,保存至少两段语音数据。来源信息用于指示语音数据段是所述麦克风捕获,或者所述来源信息用于指示语音数据段是所述第二终端的语音数据。With reference to the first aspect, in another possible design manner, the first terminal may save at least two pieces of voice data in accordance with the chronological order of recording each piece of voice data and the source information of each piece of voice data. The source information is used to indicate that the voice data segment is captured by the microphone, or the source information is used to indicate that the voice data segment is the voice data of the second terminal.
结合第一方面,在另一种可能的设计方式中,上述第一界面中不仅可以包括至少两个文本信息,还可以包括至少两个播放器插件。至少两个播放插件用于播放至少两段语音数据,至少两个播放插件与至少两段文本信息一一对应。With reference to the first aspect, in another possible design manner, the above-mentioned first interface may include not only at least two text information but also at least two player plug-ins. At least two playback plug-ins are used to play at least two pieces of voice data, and at least two playback plug-ins correspond to at least two pieces of text information one by one.
第二方面,本申请实施例提供一种终端,该终端是第一终端。该终端包括:一个或多个处理器、存储器、触摸屏、麦克风、通信接口、受话器和扬声器;存储器、显示器、通信接口与处理器耦合;触摸屏用于显示处理器生成的图像;麦克风用于捕获语音数据;存储器用于存储计算机程序代码;计算机程序代码包括计算机指令,当处理器执行上述计算机指令时,处理器,用于通过通信接口与第二终端进行语音通信;识别麦克风捕获的语音数据;若识别到麦克风捕获的第一语音数据对应的文本与预置启动词匹配,则开始录制第二终端的语音数据,第二终端的语音数据是由接收自第二终端的音频电信号转换而来的;处理器,还用于在存储器中保存录制的语音数据;处理器,还用于在语音数据录制结束后,控制触摸屏显示第一界面;第一界面包括录制的语音数据对应的文本;处理器,还用于接收用户对触摸屏显示的文本的第一操作;响应于第一操作控制受话器或者扬声器播放语音数据。In a second aspect, an embodiment of the present application provides a terminal, and the terminal is a first terminal. The terminal includes: one or more processors, memory, touch screen, microphone, communication interface, receiver and speaker; the memory, display, communication interface is coupled to the processor; the touch screen is used to display images generated by the processor; the microphone is used to capture speech Data; the memory is used to store computer program code; the computer program code includes computer instructions, and when the processor executes the computer instructions, the processor is configured to perform voice communication with the second terminal through the communication interface; identify the voice data captured by the microphone; It is recognized that the text corresponding to the first voice data captured by the microphone matches the preset start word, and then starts recording the voice data of the second terminal. The voice data of the second terminal is converted from the audio and electrical signals received from the second terminal. The processor is further configured to store the recorded voice data in the memory; the processor is further configured to control the touch screen to display a first interface after the voice data recording is completed; the first interface includes text corresponding to the recorded voice data; the processor , Also used to receive the user's display of the touch screen This first operation; in response to the first operation control receiver or a speaker to play the voice data.
结合第二方面,在一种可能的设计方式中,上述处理器,用于在语音数据录制结束后,控制触摸屏显示第一界面,包括:处理器,用于在语音数据录制结束后,自动控制触摸屏显示第一界面;或者,处理器,用于响应于语音通信结束,控制触摸屏显示第一界面,语音通信结束时语音数据录制结束;或者,处理器,用于在语音数据录制结束后,响应于用户输入第二操作,控制触摸屏显示第一界面,第二操作用于指示 终端显示终端的通话记录界面,第一界面为通话记录界面,通话记录界面包括语音通信的通话记录项,通话记录项用于记录语音通信的通话记录信息,通话记录项中包括录制的语音数据对应的文本;或者,处理器,用于在语音数据录制结束后,响应于用户对通话记录界面中语音通话的通话记录项的第三操作,控制触摸屏显示第一界面,第三操作用于指示终端显示语音通信的记录详情界面,第一界面是记录详情界面,记录详情界面用于显示语音通信的通话记录信息,记录详情界面中包括录制的语音数据对应的文本。With reference to the second aspect, in a possible design manner, the processor is configured to control the touch screen to display the first interface after the voice data recording ends, and includes a processor to automatically control the voice data after the voice data recording ends. The touch screen displays the first interface; or, the processor is configured to respond to the end of the voice communication, and controls the touch screen to display the first interface, and the voice data recording ends when the voice communication ends; or the processor is configured to respond after the voice data recording ends. After the user inputs a second operation, the touch screen is controlled to display the first interface, and the second operation is used to instruct the terminal to display the call history interface of the terminal. The first interface is the call history interface, and the call history interface includes a call history entry and a call history entry for voice communication. It is used to record the call log information of the voice communication, and the call log item includes the text corresponding to the recorded voice data; or, the processor is configured to respond to the user's call record of the voice call in the call record interface after the voice data recording ends. The third operation of the item is to control the touch screen to display the first interface. The operation is used to instruct the terminal to display a record detail interface for voice communication. The first interface is a record detail interface. The record detail interface is used to display call record information for voice communication. The record detail interface includes text corresponding to the recorded voice data.
结合第二方面,在另一种可能的设计方式中,上述处理器,还用于若识别到麦克风捕获的第一语音数据对应的文本与预置启动词匹配,则第一终端开始录制麦克风捕获的语音数据。With reference to the second aspect, in another possible design manner, the processor is further configured to: if it is recognized that the text corresponding to the first voice data captured by the microphone matches the preset start word, the first terminal starts recording the microphone capture Voice data.
结合第二方面,在另一种可能的设计方式中,上述录制的语音数据对应的文本包括至少两段文本信息,语音数据包括至少两段语音数据,至少两段文本信息与至少两段语音数据一一对应。处理器,用于接收用户对触摸屏显示的文本的第一操作,响应于第一操作控制受话器或者扬声器播放语音数据,包括:处理器,用于接收用户对触摸屏显示的第一文本信息的第一操作;响应于第一操作,控制受话器或者扬声器播放第一语音数据段。其中,第一文本信息为至少两段文本信息中的一段文本信息;第一文本信息与第一语音数据段对应。With reference to the second aspect, in another possible design manner, the text corresponding to the recorded voice data includes at least two pieces of text information, the voice data includes at least two pieces of voice data, at least two pieces of text information, and at least two pieces of voice data One-to-one correspondence. The processor is configured to receive a first operation performed by the user on the text displayed on the touch screen, and control the receiver or the speaker to play voice data in response to the first operation. Operation; in response to the first operation, controlling the receiver or speaker to play the first voice data segment. The first text information is a piece of text information in at least two pieces of text information; the first text information corresponds to the first voice data segment.
结合第二方面,在另一种可能的设计方式中,上述处理器,还用于接收用户对触摸屏显示的第二文本信息的第四操作,第四操作用于将第二文本信息修改为第三文本信息;第二文本信息为至少两段文本信息中的一段文本信息,第二文本信息与第二语音数据段对应;响应于第四操作,将第二文本信息修改为第三文本信息;在存储器中保存第三文本信息,以及第三文本信息与第二语音数据段的对应关系。处理器,还用于控制触摸屏在第一界面上显示第三文本信息;接收用户对触摸屏显示的第三文本信息的第一操作,控制受话器或者扬声器播放第二语音数据段。With reference to the second aspect, in another possible design manner, the foregoing processor is further configured to receive a fourth operation performed by the user on the second text information displayed on the touch screen, and the fourth operation is configured to modify the second text information into the first operation. Three text information; the second text information is a piece of text information in at least two pieces of text information, and the second text information corresponds to the second voice data segment; and the second text information is modified into the third text information in response to the fourth operation; The third text information and the correspondence between the third text information and the second voice data segment are stored in the memory. The processor is further configured to control the touch screen to display the third text information on the first interface; receive a first operation of the user on the third text information displayed on the touch screen, and control the receiver or speaker to play the second voice data segment.
结合第二方面,在另一种可能的设计方式中,上述处理器,用于控制触摸屏显示第一界面,包括:处理器,用于控制触摸屏按照录制文本信息对应的语音数据段的时间先后顺序,以及文本信息对应的语音数据段的来源信息,在第一界面显示至少两段文本信息。其中,来源信息用于指示语音数据段是麦克风捕获的语音数据,或者第二终端的语音数据。With reference to the second aspect, in another possible design manner, the processor for controlling the touch screen to display the first interface includes: a processor for controlling the touch screen according to a chronological order of voice data segments corresponding to recorded text information And the source information of the voice data segment corresponding to the text information, at least two pieces of text information are displayed on the first interface. The source information is used to indicate that the voice data segment is voice data captured by a microphone or voice data of a second terminal.
结合第二方面,在另一种可能的设计方式中,上述处理器,还用于若识别到麦克风捕获的第一语音数据对应的文本与预置启动词匹配,则发出第一提示信息,第一提示信息用于提示用户终端开始录制语音数据,第一提示信息为提示音或者振动提示。With reference to the second aspect, in another possible design manner, the processor is further configured to issue a first prompt message if it is recognized that the text corresponding to the first voice data captured by the microphone matches a preset start word, the first A prompt message is used to prompt the user terminal to start recording voice data, and the first prompt message is a prompt sound or a vibration prompt.
结合第二方面,在另一种可能的设计方式中,上述处理器,还用于控制触摸屏显示第一界面之前,在录制语音数据过程中,若识别到麦克风捕获的第二语音数据对应的文本与预置结束醒词匹配,则停止录制语音数据。With reference to the second aspect, in another possible design manner, the processor is further configured to control the touch screen to display the first interface, and during the recording of the voice data, if the text corresponding to the second voice data captured by the microphone is recognized If it matches the preset end word, it will stop recording voice data.
结合第二方面,在另一种可能的设计方式中,上述处理器,还用于在录制语音数据过程中,若识别到麦克风捕获的第二语音数据对应的文本与预置结束醒词匹配,则发出第二提示信息,第二提示信息用于提示用户第一终端停止录制语音数据,第二提示信息为提示音或者振动提示。With reference to the second aspect, in another possible design manner, the foregoing processor is further configured to, during recording voice data, if it is recognized that the text corresponding to the second voice data captured by the microphone matches the preset ending word, Then, a second prompt message is issued. The second prompt message is used to prompt the user to stop recording the voice data at the first terminal, and the second prompt message is a prompt sound or a vibration prompt.
结合第二方面,在另一种可能的设计方式中,上述第一提示信息和第二提示信息为提示音,处理器,还用于在第一语音数据对应的文本与预置启动词匹配时,确定由扬声器播放语音数据;根据扬声器播放的语音数据对麦克风采集的语音数据进行回声抑制;在受话器或者扬声器播放第二提示信息后,停止对麦克风采集的语音数据进行回声抑制。With reference to the second aspect, in another possible design manner, the first prompt information and the second prompt information are prompt sounds, and the processor is further configured to: when the text corresponding to the first voice data matches a preset start word , Determine that the speaker plays the voice data; perform echo suppression on the voice data collected by the microphone according to the voice data played by the speaker; and stop the echo suppression of the voice data collected by the microphone after the receiver or the speaker plays the second prompt message.
结合第二方面,在另一种可能的设计方式中,上述存储器按照录制每段语音数据的时间先后顺序,以及每段语音数据的来源信息,保存至少两段语音数据。来源信息用于指示语音数据段是麦克风捕获,或者来源信息用于指示语音数据段是第二终端的语音数据。With reference to the second aspect, in another possible design manner, the foregoing memory stores at least two pieces of voice data in accordance with the chronological order of recording each piece of voice data and the source information of each piece of voice data. The source information is used to indicate that the voice data segment is captured by a microphone, or the source information is used to indicate that the voice data segment is voice data of the second terminal.
结合第二方面,在另一种可能的设计方式中,上述触摸屏显示的第一界面中还包括至少两个播放器插件,至少两个播放插件与至少两段文本信息一一对应,至少两个播放插件与至少两段语音数据一一对应。处理器,还用于接收用户对至少两个播放器插件中第一播放器插件的点击操作,控制受话器或者扬声器播放第一播放器插件对应的语音数据段。With reference to the second aspect, in another possible design manner, the first interface displayed on the touch screen further includes at least two player plug-ins, and at least two playback plug-ins correspond to at least two pieces of text information, at least two The playback plug-in corresponds to at least two pieces of voice data. The processor is further configured to receive a user's click operation on at least two player plug-ins of the first player plug-in, and control the receiver or speaker to play the voice data segment corresponding to the first player plug-in.
第三方面,本申请实施例提供一种计算机存储介质,所述计算机存储介质包括计算机指令,当所述计算机指令在终端上运行时,使得所述终端执行第一方面及其任一种可能的设计方式所述的通信过程中信息的记录及显示方法。In a third aspect, an embodiment of the present application provides a computer storage medium, where the computer storage medium includes computer instructions, and when the computer instructions are executed on a terminal, the terminal is caused to execute the first aspect and any one of the possible The method of recording and displaying information in the communication process according to the design mode.
第四方面,本申请实施例提供一种计算机程序产品,当所述计算机程序产品在计算机上运行时,使得所述计算机执行第一方面及其任一种可能的设计方式所述的通信过程中信息的记录及显示方法。According to a fourth aspect, an embodiment of the present application provides a computer program product. When the computer program product runs on a computer, the computer is caused to execute the communication process described in the first aspect and any possible design manner. Information recording and display methods.
另外,第二方面及其任一种设计方式所述的终端,以及第三方面所述的计算机存储介质、第四方面所述的计算机程序产品所带来的技术效果可参见上述第一方面及其不同设计方式所带来的技术效果,此处不再赘述。In addition, for the technical effects brought by the terminal described in the second aspect and any one of the design methods, the computer storage medium described in the third aspect, and the computer program product described in the fourth aspect, refer to the foregoing first aspect and The technical effects brought by its different design methods are not repeated here.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1为本申请实施例提供的一种显示界面实例示意图一;FIG. 1 is a first schematic view of a display interface example provided by an embodiment of the present application; FIG.
图2为本申请实施例提供的一种终端的硬件结构示意图;2 is a schematic diagram of a hardware structure of a terminal according to an embodiment of the present application;
图3为本申请实施例提供的一种通信场景实例示意图;FIG. 3 is a schematic diagram of an example communication scenario according to an embodiment of the present application; FIG.
图4为本申请实施例提供的一种通信过程中信息的记录及显示方法流程图一;4 is a first flowchart of a method for recording and displaying information in a communication process according to an embodiment of the present application;
图5为本申请实施例提供的一种显示界面实例示意图二;FIG. 5 is a second schematic diagram of a display interface example provided by an embodiment of the present application; FIG.
图6为本申请实施例提供的一种显示界面实例示意图三;FIG. 6 is a third schematic diagram of a display interface example provided by an embodiment of the present application; FIG.
图7为本申请实施例提供的一种显示界面实例示意图四;FIG. 7 is a fourth schematic view of a display interface example provided by an embodiment of the present application; FIG.
图8为本申请实施例提供的一种显示界面实例示意图五;FIG. 8 is a schematic diagram 5 of an example of a display interface provided by an embodiment of the present application;
图9为本申请实施例提供的一种显示界面实例示意图六;FIG. 9 is a schematic diagram 6 of an example of a display interface according to an embodiment of the present application;
图10A为本申请实施例提供的一种通信过程中信息的记录及显示方法流程图二;10A is a second flowchart of a method for recording and displaying information in a communication process according to an embodiment of the present application;
图10B为本申请实施例提供的一种通信过程中信息的记录及显示方法流程图三;10B is a third flowchart of a method for recording and displaying information in a communication process according to an embodiment of the present application;
图10C为本申请实施例提供的一种显示界面实例示意图七;FIG. 10C is a schematic diagram VII of a display interface example provided by an embodiment of the present application; FIG.
图11为本申请实施例提供的一种显示界面实例示意图八;FIG. 11 is a schematic diagram 8 of an example of a display interface provided by an embodiment of the present application;
图12为本申请实施例提供的一种显示界面实例示意图九;FIG. 12 is a schematic diagram IX of a display interface example provided by an embodiment of the present application;
图13为本申请实施例提供的一种显示界面实例示意图十;FIG. 13 is a schematic diagram 10 of an example of a display interface provided by an embodiment of the present application;
图14为本申请实施例提供的一种终端的结构组成示意图一;14 is a first schematic structural composition diagram of a terminal according to an embodiment of the present application;
图15为本申请实施例提供的一种终端的结构组成示意图二。FIG. 15 is a second schematic structural diagram of a terminal according to an embodiment of the present application.
具体实施方式detailed description
本申请实施例提供一种通信过程中信息的记录及显示方法及终端,可以应用于第一终端与第二终端进行语音通信的过程中。具体的,在第一终端与第二终端进行语音通信的过程中,当第一终端的麦克风捕获的第一语音数据对应的文本与预置启动词文本匹配时,第一终端可以自动开始录制语音通信对应的语音数据。其中,第一终端的麦克风捕获的语音数据为第一终端的用户发出的语音数据。换言之,第一终端在接收第一终端的用户发出的语音命令(即文本与预置启动词文本匹配的第一语音数据)后,可以自动开始录制语音通信对应的语音数据。The embodiments of the present application provide a method and a terminal for recording and displaying information in a communication process, which can be applied to a process in which a first terminal performs voice communication with a second terminal. Specifically, during the voice communication between the first terminal and the second terminal, when the text corresponding to the first voice data captured by the microphone of the first terminal matches the text of the preset start word, the first terminal may automatically start recording the voice Communication corresponding voice data. The voice data captured by the microphone of the first terminal is voice data sent by a user of the first terminal. In other words, after receiving the voice command (ie, the first voice data whose text matches the text of the preset start word) issued by the user of the first terminal, the first terminal can automatically start recording the voice data corresponding to the voice communication.
当然,当第一终端的麦克风捕获的第二语音数据对应的文本与预置结束词文本匹配时,第一终端可以自动停止录制语音数据。Of course, when the text corresponding to the second voice data captured by the microphone of the first terminal matches the text of the preset ending word, the first terminal may automatically stop recording the voice data.
综上所述,在第一终端与第二终端进行语音通信的过程中,第一终端响应于用户的语音命令,可以自动录制和停止录制语音数据。通过本方案,使得终端更加智能化,并且提高了终端与用户的交互性能,提升了用户体验。In summary, during the voice communication between the first terminal and the second terminal, the first terminal can automatically record and stop recording voice data in response to a user's voice command. This solution makes the terminal more intelligent, improves the interaction performance between the terminal and the user, and improves the user experience.
本申请实施例中的终端可以为便携式计算机(如手机)、笔记本电脑、个人计算机(personal computer,PC)、可穿戴电子设备(如智能手表)、平板电脑、增强现实(augmented reality,AR)\虚拟现实(virtual reality,VR)设备、车载电脑等,以下实施例对该终端的具体形式不做特殊限制。The terminal in the embodiment of the present application may be a portable computer (such as a mobile phone), a notebook computer, a personal computer (PC), a wearable electronic device (such as a smart watch), a tablet computer, or augmented reality (AR) \ Virtual reality (VR) equipment, on-board computers, and the like, the following embodiments do not specifically limit the specific form of the terminal.
请参考图2,其示出本申请实施例提供一种终端200的结构框图。其中,终端200可以包括处理器210,外部存储器接口220,内部存储器221,通用串行总线(universal serial bus,USB)接口230,充电管理模块240,电源管理模块241,电池242,天线1,天线2,射频模块250,通信模块260,音频模块270,扬声器270A,受话器270B,麦克风270C,耳机接口270D,传感器模块280,按键290,马达291,指示器292,摄像头293,显示屏294,以及用户标识模块(subscriber identification module,SIM)卡接口295等。其中传感器模块280可以包括压力传感器280A,陀螺仪传感器280B,气压传感器280C,磁传感器280D,加速度传感器280E,距离传感器280F,接近光传感器280G,指纹传感器280H,温度传感器280J,触摸传感器280K,环境光传感器280L,骨传导传感器280M等。Please refer to FIG. 2, which illustrates a structural block diagram of a terminal 200 provided by an embodiment of the present application. The terminal 200 may include a processor 210, an external memory interface 220, an internal memory 221, a universal serial bus (USB) interface 230, a charge management module 240, a power management module 241, a battery 242, an antenna 1, and an antenna. 2. RF module 250, communication module 260, audio module 270, speaker 270A, receiver 270B, microphone 270C, headphone interface 270D, sensor module 280, button 290, motor 291, indicator 292, camera 293, display 294, and user Identification module (subscriber identification module, SIM) card interface 295, etc. The sensor module 280 may include a pressure sensor 280A, a gyro sensor 280B, an air pressure sensor 280C, a magnetic sensor 280D, an acceleration sensor 280E, a distance sensor 280F, a proximity light sensor 280G, a fingerprint sensor 280H, a temperature sensor 280J, a touch sensor 280K, and ambient light. Sensor 280L, bone conduction sensor 280M, etc.
本申请实施例示意的结构并不构成对终端200的限定。可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。图示的部件可以以硬件,软件或软件和硬件的组合实现。The structure illustrated in the embodiment of the present application does not limit the terminal 200. It may include more or fewer parts than shown, or some parts may be combined, or some parts may be split, or different parts may be arranged. The illustrated components can be implemented in hardware, software, or a combination of software and hardware.
处理器210可以包括一个或多个处理单元。例如,处理器210可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processing unit,GPU),图像信号处理器(image signal processor,ISP),控制器,存储器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。The processor 210 may include one or more processing units. For example, the processor 210 may include an application processor (AP), a modem processor, a graphics processing unit (GPU), an image signal processor (ISP), a controller, and a memory. , Video codec, digital signal processor (DSP), baseband processor, and / or neural-network processing unit (NPU). Among them, different processing units may be independent devices or integrated in one or more processors.
上述控制器可以是指挥终端200的各个部件按照指令协调工作的决策者。是终端 200的神经中枢和指挥中心。控制器根据指令操作码和时序信号,产生操作控制信号,完成取指令和执行指令的控制。The above-mentioned controller may be a decision maker that instructs each component of the terminal 200 to coordinate work according to instructions. It is the nerve center and command center of the terminal 200. The controller generates operation control signals according to the instruction operation code and timing signals, and completes the control of fetching and executing the instructions.
处理器210中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器210中的存储器为高速缓冲存储器,可以保存处理器210刚用过或循环使用的指令或数据。如果处理器210需要再次使用该指令或数据,可从所述存储器中直接调用。避免了重复存取,减少了处理器210的等待时间,因而提高了系统的效率。The processor 210 may further include a memory for storing instructions and data. In some embodiments, the memory in the processor 210 is a cache memory, which can store instructions or data that the processor 210 has just used or recycled. If the processor 210 needs to use the instruction or data again, it can be directly called from the memory. Repeated accesses are avoided and the waiting time of the processor 210 is reduced, thereby improving the efficiency of the system.
在一些实施例中,处理器210可以包括接口。接口可以包括集成电路(Inter-Integrated Circuit,I2C)接口,集成电路内置音频(inter-integrated circuit Sound,I2S)接口,脉冲编码调制(pulse code modulation,PCM)接口,通用异步收发传输器(universal asynchronous receiver/transmitter,UART)接口,移动产业处理器接口(mobile industry processor interface,MIPI),通用输入输出(general-purpose input/output,GPIO)接口,SIM接口,和/或USB接口等。In some embodiments, the processor 210 may include an interface. The interface may include an integrated circuit (Inter-Integrated Circuit, I2C) interface, an integrated circuit (Inter-integrated circuit, Sound, I2S) interface, a pulse code modulation (pulse code modulation, PCM) interface, universal asynchronous transceiver (universal asynchronous) receiver / transmitter (UART) interface, mobile industry processor interface (MIPI), general-purpose input / output (GPIO) interface, SIM interface, and / or USB interface.
I2C接口是一种双向同步串行总线,包括一根串行数据线(serial data line,SDL)和一根串行时钟线(derail clock line,SCL)。在一些实施例中,处理器210可以包含多组I2C总线。处理器210可以通过不同的I2C总线接口分别耦合触摸传感器280K,充电器,闪光灯,摄像头293等。例如:处理器210可以通过I2C接口耦合触摸传感器280K,使处理器210与触摸传感器280K通过I2C总线接口通信,实现终端200的触摸功能。The I2C interface is a two-way synchronous serial bus that includes a serial data line (SDL) and a serial clock line (SCL). In some embodiments, the processor 210 may include multiple sets of I2C buses. The processor 210 may be coupled to a touch sensor 280K, a charger, a flash, a camera 293 and the like through different I2C bus interfaces. For example, the processor 210 may be coupled to the touch sensor 280K through the I2C interface, so that the processor 210 and the touch sensor 280K communicate through the I2C bus interface to implement the touch function of the terminal 200.
I2S接口可以用于音频通信。在一些实施例中,处理器210可以包含多组I2S总线。处理器210可以通过I2S总线与音频模块270耦合,实现处理器210与音频模块270之间的通信。在一些实施例中,音频模块270可以通过I2S接口向通信模块260传递音频信号,实现通过蓝牙耳机接听电话的功能。The I2S interface can be used for audio communication. In some embodiments, the processor 210 may include multiple sets of I2S buses. The processor 210 may be coupled to the audio module 270 through an I2S bus to implement communication between the processor 210 and the audio module 270. In some embodiments, the audio module 270 can transmit audio signals to the communication module 260 through the I2S interface, so as to implement the function of receiving calls through a Bluetooth headset.
PCM接口也可以用于音频通信,将模拟信号抽样,量化和编码。在一些实施例中,音频模块270与通信模块260可以通过PCM总线接口耦合。在一些实施例中,音频模块270也可以通过PCM接口向通信模块260传递音频信号,实现通过蓝牙耳机接听电话的功能。所述I2S接口和所述PCM接口都可以用于音频通信,两种接口的采样速率不同。The PCM interface can also be used for audio communications, sampling, quantizing, and encoding analog signals. In some embodiments, the audio module 270 and the communication module 260 may be coupled through a PCM bus interface. In some embodiments, the audio module 270 may also transmit audio signals to the communication module 260 through the PCM interface, so as to implement the function of receiving a call through a Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication, and the sampling rates of the two interfaces are different.
UART接口是一种通用串行数据总线,用于异步通信。该总线为双向通信总线。它将要传输的数据在串行通信与并行通信之间转换。在一些实施例中,UART接口通常被用于连接处理器210与通信模块260。例如:处理器210通过UART接口与蓝牙模块通信,实现蓝牙功能。在一些实施例中,音频模块270可以通过UART接口向通信模块260传递音频信号,实现通过蓝牙耳机播放音乐的功能。The UART interface is a universal serial data bus for asynchronous communication. This bus is a two-way communication bus. It converts the data to be transferred between serial and parallel communications. In some embodiments, a UART interface is typically used to connect the processor 210 and the communication module 260. For example, the processor 210 communicates with a Bluetooth module through a UART interface to implement a Bluetooth function. In some embodiments, the audio module 270 may transmit audio signals to the communication module 260 through a UART interface, so as to implement a function of playing music through a Bluetooth headset.
MIPI接口可以被用于连接处理器210与显示屏294,摄像头293等外围器件。MIPI接口包括摄像头串行接口(camera serial interface,CSI),显示屏串行接口(display serial interface,DSI)等。在一些实施例中,处理器210和摄像头293通过CSI接口通信,实现终端200的拍摄功能。处理器210和显示屏294通过DSI接口通信,实现终端200的显示功能。The MIPI interface can be used to connect the processor 210 with peripheral devices such as the display 294 and the camera 293. The MIPI interface includes a camera serial interface (CSI), a display serial interface (DSI), and the like. In some embodiments, the processor 210 and the camera 293 communicate through a CSI interface to implement a shooting function of the terminal 200. The processor 210 and the display screen 294 communicate through a DSI interface to implement a display function of the terminal 200.
GPIO接口可以通过软件配置。GPIO接口可以配置为控制信号,也可配置为数据信号。在一些实施例中,GPIO接口可以用于连接处理器210与摄像头293,显示屏294, 通信模块260,音频模块270,传感器模块280等。GPIO接口还可以被配置为I2C接口,I2S接口,UART接口,MIPI接口等。The GPIO interface can be configured by software. The GPIO interface can be configured as a control signal or as a data signal. In some embodiments, the GPIO interface may be used to connect the processor 210 with the camera 293, the display screen 294, the communication module 260, the audio module 270, the sensor module 280, and the like. GPIO interface can also be configured as I2C interface, I2S interface, UART interface, MIPI interface, etc.
USB接口230可以是Mini USB接口,Micro USB接口,USB Type C接口等。USB接口230可以用于连接充电器为终端200充电,也可以用于终端200与外围设备之间传输数据。也可以用于连接耳机,通过耳机播放音频。还可以用于连接其他电子设备,例如AR设备等。The USB interface 230 may be a Mini USB interface, a Micro USB interface, a USB Type C interface, and the like. The USB interface 230 may be used to connect a charger to charge the terminal 200, and may also be used to transfer data between the terminal 200 and a peripheral device. It can also be used to connect headphones and play audio through headphones. It can also be used to connect other electronic devices, such as AR devices.
本申请实施例示意的各模块间的接口连接关系,只是示意性说明,并不构成对终端200的结构限定。终端200可以采用本申请实施例中不同的接口连接方式,或多种接口连接方式的组合。The interface connection relationship between the modules illustrated in the embodiments of the present application is only a schematic description, and does not constitute a limitation on the structure of the terminal 200. The terminal 200 may adopt different interface connection modes or a combination of multiple interface connection modes in the embodiments of the present application.
充电管理模块240用于从充电器接收充电输入。其中,充电器可以是无线充电器,也可以是有线充电器。在一些有线充电的实施例中,充电管理模块240可以通过USB接口230接收有线充电器的充电输入。在一些无线充电的实施例中,充电管理模块240可以通过终端200的无线充电线圈接收无线充电输入。充电管理模块240为电池242充电的同时,还可以通过电源管理模块241为终端200供电。The charging management module 240 is configured to receive a charging input from a charger. The charger may be a wireless charger or a wired charger. In some wired charging embodiments, the charging management module 240 may receive the charging input of the wired charger through the USB interface 230. In some embodiments of wireless charging, the charging management module 240 may receive a wireless charging input through a wireless charging coil of the terminal 200. While the charging management module 240 is charging the battery 242, the terminal 200 can also be powered by the power management module 241.
电源管理模块241用于连接电池242,充电管理模块240与处理器210。电源管理模块241接收所述电池242和/或充电管理模块240的输入,为处理器210,内部存储器221,外部存储器接口220,显示屏294,摄像头293,和通信模块260等供电。电源管理模块241还可以用于监测电池容量,电池循环次数,电池健康状态(漏电,阻抗)等参数。在一些实施例中,电源管理模块241也可以设置于处理器210中。在一些实施例中,电源管理模块241和充电管理模块240也可以设置于同一个器件中。The power management module 241 is used to connect the battery 242, the charge management module 240 and the processor 210. The power management module 241 receives the input from the battery 242 and / or the charge management module 240, and supplies power to the processor 210, the internal memory 221, the external memory interface 220, the display screen 294, the camera 293, and the communication module 260. The power management module 241 can also be used to monitor battery capacity, battery cycle times, battery health status (leakage, impedance) and other parameters. In some embodiments, the power management module 241 may also be disposed in the processor 210. In some embodiments, the power management module 241 and the charge management module 240 may also be provided in the same device.
终端200的无线通信功能可以通过天线1,天线2,射频模块250,通信模块260,调制解调器以及基带处理器等实现。The wireless communication function of the terminal 200 may be implemented through the antenna 1, the antenna 2, the radio frequency module 250, the communication module 260, a modem, and a baseband processor.
天线1和天线2用于发射和接收电磁波信号。终端200中的每个天线可用于覆盖单个或多个通信频带。不同的天线还可以复用,以提高天线的利用率。例如:可以将蜂窝网天线复用为无线局域网分集天线。在一些实施例中,天线可以和调谐开关结合使用。The antenna 1 and the antenna 2 are used for transmitting and receiving electromagnetic wave signals. Each antenna in the terminal 200 may be used to cover a single or multiple communication frequency bands. Different antennas can also be multiplexed to improve antenna utilization. For example, a cellular network antenna can be multiplexed into a wireless LAN diversity antenna. In some embodiments, the antenna may be used in conjunction with a tuning switch.
射频模块250可以提供应用在终端200上的包括2G/3G/4G/5G等无线通信的解决方案的通信处理模块。射频模块250可以包括至少一个滤波器,开关,功率放大器,低噪声放大器(low noise amplifier,LNA)等。射频模块250由天线1接收电磁波,并对接收的电磁波进行滤波,放大等处理,传送至调制解调器进行解调。射频模块250还可以对经调制解调器调制后的信号放大,经天线1转为电磁波辐射出去。在一些实施例中,射频模块250的至少部分功能模块可以被设置于处理器210中。在一些实施例中,射频模块250的至少部分功能模块可以与处理器210的至少部分模块被设置在同一个器件中。The radio frequency module 250 may provide a communication processing module for a wireless communication solution including 2G / 3G / 4G / 5G and the like applied on the terminal 200. The radio frequency module 250 may include at least one filter, a switch, a power amplifier, a low noise amplifier (LNA), and the like. The radio frequency module 250 receives electromagnetic waves from the antenna 1, and performs filtering, amplification, and other processing on the received electromagnetic waves, and transmits them to the modem for demodulation. The radio frequency module 250 can also amplify the signal modulated by the modem and turn it into electromagnetic wave radiation through the antenna 1. In some embodiments, at least part of the functional modules of the radio frequency module 250 may be disposed in the processor 210. In some embodiments, at least part of the functional modules of the radio frequency module 250 may be provided in the same device as at least part of the modules of the processor 210.
调制解调器可以包括调制器和解调器。调制器用于将待发送的低频基带信号调制成中高频信号。解调器用于将接收的电磁波信号解调为低频基带信号。随后解调器将解调得到的低频基带信号传送至基带处理器处理。低频基带信号经基带处理器处理后,被传递给应用处理器。应用处理器通过音频设备(不限于扬声器270A,受话器270B等)输出声音信号,或通过显示屏294显示图像或视频。在一些实施例中,调制解调 器可以是独立的器件。在一些实施例中,调制解调器可以独立于处理器210,与射频模块250或其他功能模块设置在同一个器件中。The modem may include a modulator and a demodulator. The modulator is used for modulating the low-frequency baseband signal to be transmitted into a medium-high frequency signal. The demodulator is used to demodulate the received electromagnetic wave signal into a low-frequency baseband signal. The demodulator then transmits the demodulated low-frequency baseband signal to a baseband processor for processing. The low-frequency baseband signal is processed by the baseband processor and then passed to the application processor. The application processor outputs a sound signal through an audio device (not limited to the speaker 270A, the receiver 270B, etc.), or displays an image or video through the display screen 294. In some embodiments, the modem may be a separate device. In some embodiments, the modem may be independent of the processor 210 and disposed in the same device as the radio frequency module 250 or other functional modules.
通信模块260可以提供应用在终端200上的包括无线局域网(wireless local area networks,WLAN)(如无线保真(wireless fidelity,Wi-Fi)网络),蓝牙(blue yooth,BT),全球导航卫星系统(global navigation satellite system,GNSS),调频(frequency modulation,FM),近距离无线通信技术(near field communication,NFC),红外技术(infrared,IR)等无线通信的解决方案的通信处理模块。通信模块260可以是集成至少一个通信处理模块的一个或多个器件。通信模块260经由天线2接收电磁波,将电磁波信号调频以及滤波处理,将处理后的信号发送到处理器210。通信模块260还可以从处理器210接收待发送的信号,对其进行调频,放大,经天线2转为电磁波辐射出去。The communication module 260 may provide wireless local area networks (WLAN) (such as wireless fidelity (Wi-Fi) networks), bluetooth (BT), and global navigation satellite systems applied to the terminal 200. (Global navigation, satellite system, GNSS), a communication processing module for wireless communication solutions such as frequency modulation (FM), near field communication (NFC), and infrared (IR). The communication module 260 may be one or more devices that integrate at least one communication processing module. The communication module 260 receives electromagnetic waves via the antenna 2, frequency-modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 210. The communication module 260 may also receive a signal to be transmitted from the processor 210, frequency-modulate it, amplify it, and convert it into electromagnetic wave radiation through the antenna 2.
在一些实施例中,终端200的天线1和射频模块250耦合,天线2和通信模块260耦合,使得终端200可以通过无线通信技术与网络以及其他设备通信。所述无线通信技术可以包括全球移动通讯系统(global system for mobile communications,GSM),通用分组无线服务(general packet radio service,GPRS),码分多址接入(code division multiple access,CDMA),宽带码分多址(wideband code division multiple access,WCDMA),时分码分多址(time-division code division multiple access,TD-SCDMA),长期演进(long term evolution,LTE),BT,GNSS,WLAN,NFC,FM,和/或IR技术等。所述GNSS可以包括全球卫星定位系统(satellite based augmentation systems,SBAS),全球导航卫星系统(global navigation satellite system,GLONASS),北斗卫星导航系统(BeiDou navigation satellite system,BDS),准天顶卫星系统(Quasi-Zenith satellite system,QZSS)和/或星基增强系统(satellite based augmentation systems,SBAS)。In some embodiments, the antenna 1 of the terminal 200 is coupled to the radio frequency module 250, and the antenna 2 is coupled to the communication module 260, so that the terminal 200 can communicate with the network and other devices through wireless communication technology. The wireless communication technology may include a global mobile communication system (GSM), a general packet radio service (GPRS), a code division multiple access (CDMA), and broadband. Code division multiple access (wideband code division multiple access, WCDMA), time-division code division multiple access (TD-SCDMA), long term evolution (LTE), BT, GNSS, WLAN, NFC , FM, and / or IR technology. The GNSS may include global satellite positioning systems (SBAS), global navigation satellite systems (GLONASS), BeiDou navigation navigation systems (BDS), and quasi-zenith satellite systems (BDS). Quasi-Zenith satellite system (QZSS) and / or satellite-based augmentation systems (SBAS).
终端200通过GPU,显示屏294,以及应用处理器等实现显示功能。GPU为图像处理的微处理器,连接显示屏294和应用处理器。GPU用于执行数学和几何计算,用于图形渲染。处理器210可包括一个或多个GPU,其执行程序指令以生成或改变显示信息。The terminal 200 implements a display function through a GPU, a display screen 294, and an application processor. The GPU is a microprocessor for image processing and is connected to the display 294 and an application processor. The GPU is used to perform mathematical and geometric calculations for graphics rendering. The processor 210 may include one or more GPUs that execute program instructions to generate or change display information.
显示屏294用于显示图像,视频等。显示屏294包括显示面板。显示面板可以采用液晶显示屏(liquid crystal display,LCD),有机发光二极管(organic light-emitting diode,OLED),有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrix organic light emitting diode,AMOLED),柔性发光二极管(flex light-emitting diode,FLED),Miniled,MicroLed,Micro-oLed,量子点发光二极管(quantum dot light emitting diodes,QLED)等。在一些实施例中,终端200可以包括1个或N个显示屏294,N为大于1的正整数。The display screen 294 is used to display images, videos, and the like. The display screen 294 includes a display panel. The display panel can use a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active matrix organic light emitting diode or an active matrix organic light emitting diode (active-matrix organic light-emitting diode). emitting diodes (AMOLED), flexible light-emitting diodes (FLEDs), Miniled, MicroLed, Micro-oLed, quantum dot light emitting diodes (QLEDs), etc. In some embodiments, the terminal 200 may include one or N display screens 294, where N is a positive integer greater than 1.
终端200可以通过ISP,摄像头293,视频编解码器,GPU,显示屏以及应用处理器等实现拍摄功能。The terminal 200 can implement a shooting function through an ISP, a camera 293, a video codec, a GPU, a display screen, and an application processor.
ISP用于处理摄像头293反馈的数据。例如,拍照时,打开快门,光线通过镜头被传递到摄像头感光元件上,光信号转换为电信号,摄像头感光元件将所述电信号传递给ISP处理,转化为肉眼可见的图像。ISP还可以对图像的噪点,亮度,肤色进行算法优化。ISP还可以对拍摄场景的曝光,色温等参数优化。在一些实施例中,ISP可 以设置在摄像头293中。The ISP is used to process the data fed back by the camera 293. For example, when taking a picture, the shutter is opened, and the light is transmitted to the light receiving element of the camera through the lens. The light signal is converted into an electrical signal, and the light receiving element of the camera passes the electrical signal to the ISP for processing and converts the image to the naked eye. ISP can also optimize the image's noise, brightness, and skin tone. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene. In some embodiments, an ISP may be provided in the camera 293.
摄像头293用于捕获静态图像或视频。物体通过镜头生成光学图像投射到感光元件。感光元件可以是电荷耦合器件(charge coupled device,CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor,CMOS)光电晶体管。感光元件把光信号转换成电信号,之后将电信号传递给ISP转换成数字图像信号。ISP将数字图像信号输出到DSP加工处理。DSP将数字图像信号转换成标准的RGB,YUV等格式的图像信号。在一些实施例中,终端200可以包括1个或N个摄像头293,N为大于1的正整数。The camera 293 is used to capture still images or videos. An object generates an optical image through a lens and projects it onto a photosensitive element. The photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The photosensitive element converts the optical signal into an electrical signal, and then passes the electrical signal to the ISP to convert it into a digital image signal. The ISP outputs digital image signals to the DSP for processing. DSP converts digital image signals into image signals in standard RGB, YUV and other formats. In some embodiments, the terminal 200 may include one or N cameras 293, where N is a positive integer greater than 1.
数字信号处理器用于处理数字信号,除了可以处理数字图像信号,还可以处理其他数字信号。例如,当终端200在频点选择时,数字信号处理器用于对频点能量进行傅里叶变换等。A digital signal processor is used to process digital signals. In addition to digital image signals, it can also process other digital signals. For example, when the terminal 200 selects at a frequency point, the digital signal processor is used to perform a Fourier transform on the frequency point energy and the like.
视频编解码器用于对数字视频压缩或解压缩。终端200可以支持一种或多种视频编解码器。这样,终端200可以播放或录制多种编码格式的视频,例如:动态图像专家组(moving picture experts Group,MPEG)1,MPEG2,MPEG3,MPEG4等。Video codecs are used to compress or decompress digital video. The terminal 200 may support one or more video codecs. In this way, the terminal 200 can play or record videos in multiple encoding formats, such as: Moving Picture Experts Group (MPEG) 1, MPEG2, MPEG3, MPEG4, and so on.
NPU为神经网络(neural-network,NN)计算处理器,通过借鉴生物神经网络结构,例如借鉴人脑神经元之间传递模式,对输入信息快速处理,还可以不断的自学习。通过NPU可以实现终端200的智能认知等应用,例如:图像识别,人脸识别,语音识别,文本理解等。The NPU is a neural-network (NN) computing processor. By drawing on the structure of a biological neural network, such as the transfer mode between neurons in the human brain, the NPU can quickly process input information and continuously learn. Through the NPU, applications such as intelligent cognition of the terminal 200 can be implemented, such as: image recognition, face recognition, speech recognition, text understanding, and the like.
外部存储器接口220可以用于连接外部存储卡,例如Micro SD卡,实现扩展终端200的存储能力。外部存储卡通过外部存储器接口220与处理器210通信,实现数据存储功能。例如将音乐,视频等文件保存在外部存储卡中。The external memory interface 220 can be used to connect an external memory card, such as a Micro SD card, to realize the expansion of the storage capacity of the terminal 200. The external memory card communicates with the processor 210 through the external memory interface 220 to implement a data storage function. For example, save music, videos and other files on an external memory card.
内部存储器221可以用于存储计算机可执行程序代码,所述可执行程序代码包括指令。处理器210通过运行存储在内部存储器221的指令,从而执行终端200的各种功能应用以及数据处理。存储器221可以包括存储程序区和存储数据区。其中,存储程序区可存储操作系统,至少一个功能所需的应用程序(比如声音播放功能,图像播放功能等)等。存储数据区可存储终端200使用过程中所创建的数据(比如音频数据,电话本等)等。此外,存储器221可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件,闪存器件,其他易失性固态存储器件,通用闪存存储器(universal flash storage,UFS)等。The internal memory 221 may be used to store computer executable program code, where the executable program code includes instructions. The processor 210 executes various functional applications and data processing of the terminal 200 by executing instructions stored in the internal memory 221. The memory 221 may include a storage program area and a storage data area. The storage program area may store an operating system, at least one application required by a function (such as a sound playback function, an image playback function, etc.) and the like. The storage data area may store data (such as audio data, phone book, etc.) created during the use of the terminal 200. In addition, the memory 221 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one disk storage device, a flash memory device, other volatile solid-state storage devices, universal flash memory (universal flash storage, UFS), etc. .
终端200可以通过音频模块270,扬声器270A,受话器270B,麦克风270C,耳机接口270D,以及应用处理器等实现音频功能。例如音乐播放,录音等。The terminal 200 can implement audio functions through an audio module 270, a speaker 270A, a receiver 270B, a microphone 270C, a headphone interface 270D, and an application processor. Such as music playback, recording, etc.
音频模块270用于将数字音频信息转换成模拟音频信号输出,也用于将模拟音频输入转换为数字音频信号。音频模块270还可以用于对音频信号编码和解码。在一些实施例中,音频模块270可以设置于处理器210中,或将音频模块270的部分功能模块设置于处理器210中。The audio module 270 is used for converting digital audio information into an analog audio signal and outputting, and also for converting an analog audio input into a digital audio signal. The audio module 270 may also be used to encode and decode audio signals. In some embodiments, the audio module 270 may be disposed in the processor 210, or some functional modules of the audio module 270 may be disposed in the processor 210.
扬声器270A,也称“喇叭”,用于将音频电信号转换为声音信号。终端200可以通过扬声器270A收听音乐,或收听免提通话。The speaker 270A, also called a "horn", is used to convert audio electrical signals into sound signals. The terminal 200 can listen to music through the speaker 270A, or listen to a hands-free call.
受话器270B,也称“听筒”,用于将音频电信号转换成声音信号。当终端200接听电话或语音信息时,可以通过将受话器270B靠近人耳接听语音。The receiver 270B, also known as the "handset", is used to convert audio electrical signals into sound signals. When the terminal 200 answers a call or a voice message, it can answer the voice by holding the receiver 270B close to the human ear.
麦克风270C,也称“话筒”,“传声器”,用于将声音信号转换为电信号。当拨打电话或发送语音信息时,用户可以通过人嘴靠近麦克风270C发声,将声音信号输入到麦克风270C。终端200可以设置至少一个麦克风270C。在一些实施例中,终端200可以设置两个麦克风270C,除了采集声音信号,还可以实现降噪功能。在一些实施例中,终端200还可以设置三个,四个或更多麦克风270C,实现采集声音信号,降噪,还可以识别声音来源,实现定向录音功能等。Microphone 270C, also called "microphone", "microphone", is used to convert sound signals into electrical signals. When making a call or sending a voice message, the user can make a sound through the mouth of the user near the microphone 270C, and input a sound signal into the microphone 270C. The terminal 200 may be provided with at least one microphone 270C. In some embodiments, the terminal 200 may be provided with two microphones 270C. In addition to collecting sound signals, a noise reduction function may also be implemented. In some embodiments, the terminal 200 may further be provided with three, four, or more microphones 270C to realize the collection of sound signals, reduce noise, and may also identify the source of sound, and implement a directional recording function.
耳机接口270D用于连接有线耳机。耳机接口270D可以是USB接口230,也可以是3.5mm的开放移动终端平台(open mobile terminal platform,OMTP)标准接口,美国蜂窝电信工业协会(cellular telecommunications industry association of the USA,CTIA)标准接口。The headset interface 270D is used to connect a wired headset. The earphone interface 270D may be a USB interface 230 or a 3.5mm open mobile terminal platform (OMTP) standard interface, and a cellular telecommunications industry association (United States of America, CTIA) standard interface.
压力传感器280A用于感受压力信号,可以将压力信号转换成电信号。在一些实施例中,压力传感器280A可以设置于显示屏294。压力传感器280A的种类很多,如电阻式压力传感器,电感式压力传感器,电容式压力传感器等。电容式压力传感器可以是包括至少两个具有导电材料的平行板。当有力作用于压力传感器,电极之间的电容改变。终端200根据电容的变化确定压力的强度。当有触摸操作作用于显示屏294,终端200根据压力传感器280A检测所述触摸操作强度。终端200也可以根据压力传感器280A的检测信号计算触摸的位置。在一些实施例中,作用于相同触摸位置,但不同触摸操作强度的触摸操作,可以对应不同的操作指令。例如:当有触摸操作强度小于第一压力阈值的触摸操作作用于短消息应用图标时,执行查看短消息的指令。当有触摸操作强度大于或等于第一压力阈值的触摸操作作用于短消息应用图标时,执行新建短消息的指令。The pressure sensor 280A is used to sense a pressure signal, and can convert the pressure signal into an electrical signal. In some embodiments, the pressure sensor 280A may be disposed on the display screen 294. There are many types of pressure sensors 280A, such as resistive pressure sensors, inductive pressure sensors, and capacitive pressure sensors. The capacitive pressure sensor may be at least two parallel plates having a conductive material. When a force is applied to the pressure sensor, the capacitance between the electrodes changes. The terminal 200 determines the intensity of the pressure according to the change in capacitance. When a touch operation is performed on the display screen 294, the terminal 200 detects the intensity of the touch operation according to the pressure sensor 280A. The terminal 200 may also calculate the touched position based on the detection signal of the pressure sensor 280A. In some embodiments, touch operations acting on the same touch position but different touch operation intensities may correspond to different operation instructions. For example, when a touch operation with a touch operation intensity lower than the first pressure threshold is applied to the short message application icon, an instruction for viewing the short message is executed. When a touch operation with a touch operation intensity greater than or equal to the first pressure threshold is applied to the short message application icon, an instruction for creating a short message is executed.
陀螺仪传感器280B可以用于确定终端200的运动姿态。在一些实施例中,可以通过陀螺仪传感器280B确定终端200围绕三个轴(即,x,y和z轴)的角速度。陀螺仪传感器280B可以用于拍摄防抖。示例性的,当按下快门,陀螺仪传感器280B检测终端200抖动的角度,根据角度计算出镜头模组需要补偿的距离,让镜头通过反向运动抵消终端200的抖动,实现防抖。陀螺仪传感器280B还可以用于导航,体感游戏场景。The gyro sensor 280B may be used to determine a motion posture of the terminal 200. In some embodiments, the angular velocity of the terminal 200 around three axes (ie, the x, y, and z axes) may be determined by the gyro sensor 280B. The gyro sensor 280B can be used for image stabilization. Exemplarily, when the shutter is pressed, the gyro sensor 280B detects the angle of the shake of the terminal 200, and calculates the distance to be compensated by the lens module according to the angle, so that the lens can cancel the shake of the terminal 200 through the reverse movement to achieve image stabilization. The gyro sensor 280B can also be used for navigation and somatosensory gaming scenes.
气压传感器280C用于测量气压。在一些实施例中,终端200通过气压传感器280C测得的气压值计算海拔高度,辅助定位和导航。The air pressure sensor 280C is used to measure air pressure. In some embodiments, the terminal 200 calculates the altitude through the air pressure value measured by the air pressure sensor 280C, and assists in positioning and navigation.
磁传感器280D包括霍尔传感器。终端200可以利用磁传感器280D检测翻盖皮套的开合。在一些实施例中,当终端200是翻盖机时,终端200可以根据磁传感器280D检测翻盖的开合。进而根据检测到的皮套的开合状态或翻盖的开合状态,设置翻盖自动解锁等特性。The magnetic sensor 280D includes a Hall sensor. The terminal 200 may detect the opening and closing of the flip leather case by using the magnetic sensor 280D. In some embodiments, when the terminal 200 is a flip machine, the terminal 200 may detect the opening and closing of the flip according to the magnetic sensor 280D. Further, according to the opened and closed state of the holster or the opened and closed state of the flip cover, characteristics such as automatic unlocking of the flip cover are set.
加速度传感器280E可检测终端200在各个方向上(一般为三轴)加速度的大小。当终端200静止时可检测出重力的大小及方向。还可以用于识别终端姿态,应用于横竖屏切换,计步器等应用。The acceleration sensor 280E can detect the magnitude of the acceleration of the terminal 200 in various directions (generally three axes). The magnitude and direction of gravity can be detected when the terminal 200 is stationary. It can also be used to identify the posture of the terminal, and is used in applications such as switching between horizontal and vertical screens, and pedometers.
距离传感器280F,用于测量距离。终端200可以通过红外或激光测量距离。在一些实施例中,拍摄场景,终端200可以利用距离传感器280F测距以实现快速对焦。Distance sensor 280F for measuring distance. The terminal 200 can measure the distance by infrared or laser. In some embodiments, when shooting a scene, the terminal 200 may use a distance sensor 280F to measure a distance to achieve fast focusing.
接近光传感器280G可以包括例如发光二极管(LED)和光检测器,例如光电二极管。 发光二极管可以是红外发光二极管。通过发光二极管向外发射红外光。使用光电二极管检测来自附近物体的红外反射光。当检测到充分的反射光时,可以确定终端200附近有物体。当检测到不充分的反射光时,可以确定终端200附近没有物体。终端200可以利用接近光传感器280G检测用户手持终端200贴近耳朵通话,以便自动熄灭屏幕达到省电的目的。接近光传感器280G也可用于皮套模式,口袋模式自动解锁与锁屏。The proximity light sensor 280G may include, for example, a light emitting diode (LED) and a light detector, such as a photodiode. The light emitting diode may be an infrared light emitting diode. Infrared light is emitted outward through a light emitting diode. Use photodiodes to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it can be determined that there is an object near the terminal 200. When insufficient reflected light is detected, it may be determined that there is no object near the terminal 200. The terminal 200 can use the proximity light sensor 280G to detect that the user is holding the terminal 200 close to the ear to talk, so as to automatically turn off the screen to save power. The proximity light sensor 280G can also be used in holster mode, and the pocket mode automatically unlocks and locks the screen.
环境光传感器280L用于感知环境光亮度。终端200可以根据感知的环境光亮度自适应调节显示屏亮度。环境光传感器280L也可用于拍照时自动调节白平衡。环境光传感器280L还可以与接近光传感器280G配合,检测终端200是否在口袋里,以防误触。Ambient light sensor 280L is used to sense ambient light brightness. The terminal 200 can adaptively adjust the brightness of the display screen according to the perceived ambient light brightness. Ambient light sensor 280L can also be used to automatically adjust white balance when taking pictures. The ambient light sensor 280L can also cooperate with the proximity light sensor 280G to detect whether the terminal 200 is in a pocket to prevent accidental touch.
指纹传感器280H用于采集指纹。终端200可以利用采集的指纹特性实现指纹解锁,访问应用锁,指纹拍照,指纹接听来电等。The fingerprint sensor 280H is used to collect fingerprints. The terminal 200 may use the collected fingerprint characteristics to realize fingerprint unlocking, access application lock, fingerprint photographing, fingerprint answering an incoming call, and the like.
温度传感器280J用于检测温度。在一些实施例中,终端200利用温度传感器280J检测的温度,执行温度处理策略。例如,当温度传感器280J上报的温度超过阈值,终端200执行降低位于温度传感器280J附近的处理器的性能,以便降低功耗实施热保护。The temperature sensor 280J is used to detect the temperature. In some embodiments, the terminal 200 executes a temperature processing strategy using the temperature detected by the temperature sensor 280J. For example, when the temperature reported by the temperature sensor 280J exceeds the threshold, the terminal 200 executes reducing the performance of a processor located near the temperature sensor 280J, so as to reduce power consumption and implement thermal protection.
触摸传感器280K,也称“触控面板”。可设置于显示屏294。用于检测作用于其上或附近的触摸操作。可以将检测到的触摸操作传递给应用处理器,以确定触摸事件类型,并通过显示屏294提供相应的视觉输出。The touch sensor 280K is also called "touch panel". Can be set on display 294. Used to detect touch operations on or near it. The detected touch operation may be passed to the application processor to determine the type of touch event, and a corresponding visual output is provided through the display screen 294.
骨传导传感器280M可以获取振动信号。在一些实施例中,骨传导传感器280M可以获取人体声部振动骨块的振动信号。骨传导传感器280M也可以接触人体脉搏,接收血压跳动信号。在一些实施例中,骨传导传感器280M也可以设置于耳机中。音频模块270可以基于所述骨传导传感器280M获取的声部振动骨块的振动信号,解析出语音信号,实现语音功能。应用处理器可以基于所述骨传导传感器280M获取的血压跳动信号解析心率信息,实现心率检测功能。The bone conduction sensor 280M can acquire vibration signals. In some embodiments, the bone conduction sensor 280M can acquire a vibration signal of a human voice oscillating bone mass. The bone conduction sensor 280M can also contact the human pulse and receive blood pressure beating signals. In some embodiments, the bone conduction sensor 280M may also be provided in the headset. The audio module 270 may analyze a voice signal based on the vibration signal of the oscillating bone mass obtained by the bone conduction sensor 280M to implement a voice function. The application processor may analyze the heart rate information based on the blood pressure beating signal acquired by the bone conduction sensor 280M to implement a heart rate detection function.
按键290包括开机键,音量键等。按键290可以是机械按键。也可以是触摸式按键。终端200接收按键290输入,产生与终端200的用户设置以及功能控制有关的键信号输入。The keys 290 include a start key, a volume key, and the like. The key 290 may be a mechanical key. It can also be a touch button. The terminal 200 receives the input of the key 290 and generates a key signal input related to user settings and function control of the terminal 200.
马达291可以产生振动提示。马达291可以用于来电振动提示,也可以用于触摸振动反馈。例如,作用于不同应用(例如拍照,音频播放等)的触摸操作,可以对应不同的振动反馈效果。作用于显示屏294不同区域的触摸操作,也可对应不同的振动反馈效果。不同的应用场景(例如:时间提醒,接收信息,闹钟,游戏等)也可以对应不同的振动反馈效果。触摸振动反馈效果还可以支持自定义。The motor 291 can generate a vibration alert. The motor 291 can be used for vibration alert for incoming calls, and can also be used for touch vibration feedback. For example, the touch operation applied to different applications (such as taking pictures, playing audio, etc.) can correspond to different vibration feedback effects. Touch operations on different areas of the display screen 294 can also correspond to different vibration feedback effects. Different application scenarios (such as time reminders, receiving information, alarm clocks, games, etc.) can also correspond to different vibration feedback effects. Touch vibration feedback effect can also support customization.
指示器292可以是指示灯,可以用于指示充电状态,电量变化,也可以用于指示消息,未接来电,通知等。The indicator 292 can be an indicator light, which can be used to indicate the charging status, power change, and can also be used to indicate messages, missed calls, notifications, and so on.
SIM卡接口295用于连接SIM。SIM卡可以通过插入SIM卡接口295,或从SIM卡接口295拔出,实现和终端200的接触和分离。终端200可以支持1个或N个SIM卡接口,N为大于1的正整数。SIM卡接口295可以支持Nano SIM卡,Micro SIM卡,SIM卡等。同一个SIM卡接口295可以同时插入多张卡。所述多张卡的类型可以相同,也可以不同。SIM卡接口295也可以兼容不同类型的SIM卡。SIM卡接口295也可以兼容外部存储卡。终端200通过SIM卡和网络交互,实现通话以及数据通信等功能。 在一些实施例中,终端200采用eSIM。即:嵌入式SIM卡。eSIM卡可以嵌在终端200中,不能和终端200分离。The SIM card interface 295 is used to connect a SIM. The SIM card can be inserted and removed from the SIM card interface 295 to achieve contact and separation with the terminal 200. The terminal 200 may support one or N SIM card interfaces, and N is a positive integer greater than 1. The SIM card interface 295 can support Nano SIM cards, Micro SIM cards, SIM cards, etc. Multiple SIM cards can be inserted into the same SIM card interface 295 at the same time. The types of the multiple cards may be the same or different. The SIM card interface 295 can also be compatible with different types of SIM cards. The SIM card interface 295 is also compatible with external memory cards. The terminal 200 interacts with the network through a SIM card to implement functions such as calling and data communication. In some embodiments, the terminal 200 employs eSIM. That is: embedded SIM card. The eSIM card can be embedded in the terminal 200 and cannot be separated from the terminal 200.
请参考图3,其示出本申请实施例提供的一种通信过程中信息的记录及显示方法所应用的通信场景实例示意图。本申请实施例以图3所示的终端200为本申请实施例中的第一终端,终端300为第二终端,终端200与终端300进行语音通信为例,对本申请实施例提供的一种通信过程中信息的记录及显示方法进行举例说明。其中,如图3所示,在终端200与终端300进行语音通信的过程中,用户210使用终端200与使用终端300的用户310进行语音通话。Please refer to FIG. 3, which is a schematic diagram illustrating an example of a communication scenario applied to a method for recording and displaying information in a communication process according to an embodiment of the present application. In the embodiment of the present application, the terminal 200 shown in FIG. 3 is the first terminal in the embodiment of the present application, the terminal 300 is the second terminal, and the terminal 200 performs voice communication with the terminal 300 as an example. The method of recording and displaying information during the process will be illustrated by examples. Among them, as shown in FIG. 3, during the voice communication between the terminal 200 and the terminal 300, the user 210 uses the terminal 200 to make a voice call with the user 310 using the terminal 300.
本申请实施例提供一种通信过程中信息的记录方法,如图4所示,该通信过程中信息的记录方法可以包括S401-S404:An embodiment of the present application provides a method for recording information during communication. As shown in FIG. 4, the method for recording information during communication may include S401-S404:
S401、终端200与终端300进行语音通信。S401. The terminal 200 performs voice communication with the terminal 300.
S402、终端200识别终端200的麦克风270C捕获的语音数据。S402. The terminal 200 identifies voice data captured by the microphone 270C of the terminal 200.
其中,在终端200与终端300进行语音通信的过程中,终端200的麦克风270C(也称“话筒”)可以捕获终端200周围的语音数据。终端200周围的语音数据可以包括用户210发出的语音数据和终端200的周围的环境噪音。终端200可以将麦克风270C捕获的语音数据转换成文本,然后判断转换得到的文本是否与预置启动词匹配。若匹配,终端200则可以开始录制上述语音通信对应的语音数据。Among them, during the voice communication between the terminal 200 and the terminal 300, a microphone 270C (also referred to as a "microphone") of the terminal 200 can capture voice data around the terminal 200. The voice data around the terminal 200 may include voice data sent by the user 210 and ambient noise around the terminal 200. The terminal 200 may convert the voice data captured by the microphone 270C into text, and then determine whether the converted text matches a preset start word. If they match, the terminal 200 can start recording the voice data corresponding to the voice communication.
示例性的,上述预置启动词可以为“我记录一下”、“我记一下”、“开始记录”和“现在开始记录”等。Exemplarily, the above preset start words may be "Let me record", "Let me record", "Start recording", "Start recording now", and the like.
本申请实施例中的预置启动词可以是终端200出厂时,配置在终端200中的默认启动词。或者,该预置启动词可以是用户在终端200中设置的自定义启动词。The preset start word in the embodiment of the present application may be a default start word configured in the terminal 200 when the terminal 200 is shipped from the factory. Alternatively, the preset start word may be a custom start word set by the user in the terminal 200.
本申请实施例这里以图5所示的终端200(即手机200)为例,对终端接收用户设置的启动词的过程进行说明:In this embodiment of the present application, the terminal 200 (ie, the mobile phone 200) shown in FIG. 5 is used as an example to describe the process of the terminal receiving the startup word set by the user:
手机200可以接收用户对手机200的桌面上的“设置”应用图标的点击操作(如单击操作)。响应于用户对“设置”应用图标的点击操作,手机200可以显示手机设置界面。该手机设置界面中可以包括“飞行模式”选项、“WLAN”选项、“蓝牙”选项、“移动网络”选项和“系统应用”选项等。其中,“飞行模式”选项、“WLAN”选项、“蓝牙”选项和“移动网络”选项的具体功能可以参考常规技术中的具体描述,本申请实施例这里不予赘述。手机200响应于用户对“系统应用”选项的点击操作,可以显示系统应用设置界面。该系统应用设置界面中包括“电话”选项、“联系人”选项和“短信”选项等。手机200响应于用户对系统应用设置界面中“电话”选项的点击操作,可以显示图5中的(a)所示的电话设置界面501。可选的,上述设置界面中可以包括“电话”选项。手机200响应于用户对手机设置界面中“电话”选项的点击操作,可以显示图5中的(a)所示的电话设置界面501。The mobile phone 200 may receive a user's click operation (such as a click operation) on the “Settings” application icon on the desktop of the mobile phone 200. In response to the user's click operation on the “Settings” application icon, the mobile phone 200 may display a mobile phone setting interface. The mobile phone settings interface can include "airplane mode" options, "WLAN" options, "Bluetooth" options, "Mobile network" options, and "System application" options, etc. For specific functions of the "airplane mode" option, the "WLAN" option, the "Bluetooth" option, and the "mobile network" option, reference may be made to specific descriptions in conventional technologies, which are not described in the embodiment of the present application. The mobile phone 200 may display a system application setting interface in response to a user's click operation on the "System Application" option. The system application setting interface includes a "phone" option, a "contact" option, and a "text message" option. The mobile phone 200 may display the phone setting interface 501 shown in (a) of FIG. 5 in response to a user's click operation on the “phone” option in the system application setting interface. Optionally, the above-mentioned setting interface may include a "phone" option. The mobile phone 200 may display the phone setting interface 501 shown in (a) of FIG. 5 in response to a user's click operation on the “Phone” option in the mobile phone setting interface.
如图5中的(a)所示,电话设置界面501中包括“呼叫转移”选项、“呼叫等待”选项、“来电拦截”选项和“通话录音”选项502。其中,“呼叫转移”选项、“呼叫等待”选项和“来电拦截”选项的具体功能可以参考常规技术中的具体描述,本申请实施例这里不予赘述。手机200响应于“通话录音”选项502的点击操作,可以显示图5中的(b)所示的通话录音界面503。通话录音界面503中包括“录音通知”选项、“接通自动录音” 选项和“语音控制录音”选项504。“录音通知”选项开启后,手机200录音后可以在通知栏提示。“接通自动录音”选项开启后,手机200接通来电时可以自动录音。“语音控制录音”选项504开启后,在通话过程中,手机200接收用户发出的启动词(如“开始录音”或“我记一下”),便可以自动开始录音;手机200接收用户发出的结束词(如“结束录音”)或通话结束或手机200录音预设时间(如1分钟、2分钟或者5分钟等)后,便可以自动结束录音。手机200响应于用户对“语音控制录音”选项504的点击操作,可以显示图5中的(c)所示的语音控制录音界面505。语音控制录音界面505中包括语音控制录音开关506、“启动词”选项507和“结束词”选项510。“启动词”选项507中包括手机200中当前配置的启动词508和“自定义启动词”选项509。“结束词”选项510中包括手机200中当前配置的结束词511和“自定义结束词”选项512。其中,在用户还未在手机100中设置启动词和结束词之前,在“启动词”选项507中向用户指示默认启动词“开始录音”,在“结束词”选项510中向用户指示默认结束词“结束录音”。As shown in (a) of FIG. 5, the phone setting interface 501 includes a “call forwarding” option, a “call waiting” option, an “incoming call blocking” option, and a “call recording” option 502. The specific functions of the "Call Forwarding" option, "Call Waiting" option, and "Incoming Call Blocking" option can refer to the specific description in the conventional technology, which will not be repeated here in the embodiment of the present application. In response to the click operation of the “call recording” option 502, the mobile phone 200 may display the call recording interface 503 shown in FIG. 5 (b). The call recording interface 503 includes a "recording notification" option, a "connect to automatic recording" option, and a "voice control recording" option 504. After the “Recording Notification” option is turned on, the mobile phone 200 can prompt in the notification bar after recording. When the “Automatic recording on call” option is turned on, the mobile phone 200 can automatically record when a call is connected. After the "Voice Control Recording" option 504 is turned on, during the call, the mobile phone 200 receives the start word (such as "start recording" or "memory") sent by the user, and can automatically start recording; the mobile phone 200 receives the end of the user's Words (such as "End Recording") or the end of a call or the phone 200 recording preset time (such as 1 minute, 2 minutes, or 5 minutes, etc.) can automatically end the recording. In response to the user's click operation on the "Voice Control Recording" option 504, the mobile phone 200 may display the voice control recording interface 505 shown in (c) in FIG. 5. The voice-control recording interface 505 includes a voice-control recording switch 506, a "start word" option 507, and a "end word" option 510. The “starting word” option 507 includes a start word 508 and a “custom start word” option 509 currently configured in the mobile phone 200. The “end word” option 510 includes a currently configured end word 511 and a “custom end word” option 512 in the mobile phone 200. Wherein, before the user has set the start word and the end word in the mobile phone 100, the user is instructed in the "start word" option 507 to indicate the default start word "start recording", and the "end word" option 510 is indicated to the user in default The word "end recording".
手机200响应于用户对“自定义启动词”选项的点击操作,可以显示图6中所示的启动词自定义界面601。启动词自定义界面601中可以包括“取消”按钮604、“确定”按钮605、“启动词输入框”602和启动词建议603。其中,“取消”按钮604用于触发手机取消设置自定义启动词,并显示图5中的(c)所示的语音控制录音界面505。“启动词输入框”602用于接收用户输入的自定义启动词。“确定”按钮210用于保存用户在“启动词输入框”602输入的自定义启动词。启动词建议603用于提示用户手机对自定义启动词的要求。假设用户在图6所示的“启动词输入框”602输入自定义启动词“我记一下”。手机200响应于用户对图6所示的“确定”按钮605的点击操作(如单击操作),可以将手机200的启动词设置为“我记一下”。In response to the user's click operation on the "custom start word" option, the mobile phone 200 may display the start word custom interface 601 shown in FIG. 6. The start word customization interface 601 may include a "Cancel" button 604, an "OK" button 605, a "Start word input box" 602, and a start word suggestion 603. The “Cancel” button 604 is used to trigger the mobile phone to cancel the setting of the custom start word, and display the voice control recording interface 505 shown in (c) in FIG. 5. The "starting word input box" 602 is used to receive a custom starting word input by a user. The "OK" button 210 is used to save the user-defined start word input by the user in the "start word input box" 602. The start word suggestion 603 is used to prompt the user of the mobile phone's requirements for a custom start word. Assume that the user enters a custom start word "Let me remember" in the "start word input box" 602 shown in FIG. 6. In response to the user's click operation (such as a click operation) on the “OK” button 605 shown in FIG.
S403、终端200判断麦克风270C捕获的第一语音数据对应的文本是否与预置启动词匹配。S403. The terminal 200 determines whether the text corresponding to the first voice data captured by the microphone 270C matches a preset start word.
具体的,若麦克风270C捕获的第一语音数据对应的文本与预置启动词匹配,终端200则执行S404;若麦克风270C捕获的第一语音数据对应的文本与预置启动词不匹配,终端200则继续执行S402:Specifically, if the text corresponding to the first speech data captured by the microphone 270C matches the preset start word, the terminal 200 executes S404; if the text corresponding to the first speech data captured by the microphone 270C does not match the preset start word, the terminal 200 Then proceed to S402:
S404、终端200录制终端300的语音数据。S404. The terminal 200 records voice data of the terminal 300.
其中,本申请实施例中,第一语音数据对应的文本与预置启动词匹配,具体为:第一语音数据对应的文本包括预置启动词。Wherein, in the embodiment of the present application, the text corresponding to the first voice data matches the preset start word. Specifically, the text corresponding to the first voice data includes the preset start word.
示例性的,结合上述实例,终端200的预置启动词为“我记一下”。如图3所示,终端200与终端300进行语音通信的过程中,用户210发出语音数据1“可以提供一下您的地址和电话吗?”。终端200的麦克风270C则可以捕获到语音数据1“可以提供一下您的地址和电话吗?”。终端200可以识别该语音数据1,得到该语音数据1的文本“可以提供一下您的地址和电话吗?”,并确定语音数据1的文本与预置启动词“我记一下”不匹配。终端200与终端300进行语音通信的过程中,用户210发出语音数据2“我记一下”(即第一语音数据)。终端200的麦克风270C则可以捕获到语音数据2“我记一下”。终端200可以识别该语音数据2,得到该语音数据2的文本“我记一下”,并确定语音数据2的文本与预置启动词“我记一下”匹配。终端200确定语音数据2的文本与预置启动词“我记一下”匹配时,便可以开始录制语音通信对应的语音数据。如图3 所示,终端200可以在t1时刻开始录制语音通信对应的语音数据。Exemplarily, in combination with the above-mentioned example, the preset start word of the terminal 200 is "let me remember". As shown in FIG. 3, during the voice communication between the terminal 200 and the terminal 300, the user 210 sends out voice data 1 "Can you provide your address and phone number?" The microphone 270C of the terminal 200 can capture the voice data 1 "Can you provide your address and phone number?" The terminal 200 can identify the voice data 1 and obtain the text "Can you provide your address and phone number?" Of the voice data 1 and determine that the text of the voice data 1 does not match the preset start word "let me remember". During the voice communication between the terminal 200 and the terminal 300, the user 210 sends out the voice data 2 "Let me remember" (that is, the first voice data). The microphone 270C of the terminal 200 can capture the voice data 2 "Let me remember". The terminal 200 may recognize the voice data 2 and obtain a text "Let me remember" of the voice data 2 and determine that the text of the voice data 2 matches a preset start word "Let me remember". When the terminal 200 determines that the text of the voice data 2 matches the preset start word “let me remember”, it can start recording the voice data corresponding to the voice communication. As shown in FIG. 3, the terminal 200 may start recording voice data corresponding to voice communication at time t1.
本申请实施例中,终端300的语音数据是由接收自终端300的音频电信号转换而来的。In the embodiment of the present application, the voice data of the terminal 300 is converted from audio electrical signals received from the terminal 300.
可选的,终端200还可以录制终端200的麦克风270C捕获的语音数据。如果麦克风270C捕获的第一语音数据对应的文本与预置启动词匹配,终端200不仅可以执行S404,还可以执行S404':Optionally, the terminal 200 may also record voice data captured by the microphone 270C of the terminal 200. If the text corresponding to the first voice data captured by the microphone 270C matches the preset start word, the terminal 200 may execute not only S404, but also S404 ':
S404'、终端200录制终端200的麦克风270C捕获的语音数据。S404 ', the terminal 200 records voice data captured by the microphone 270C of the terminal 200.
也就是说,终端200录制的语音数据可以包括:终端200的麦克风270C捕获的语音数据,以及由接收自终端300的音频电信号转换而来的语音数据。That is, the voice data recorded by the terminal 200 may include: voice data captured by the microphone 270C of the terminal 200, and voice data converted from audio electrical signals received from the terminal 300.
其中,终端200的麦克风270C可以接收用户210发出的语音数据(即第一声音信号)。终端200可以将第一声音信号转换为第一音频电信号。然后,终端200向终端300发送该第一音频电信号。终端300可以将来自终端200的第一音频电信号转换为第一声音信号,并通过受话器(也称“听筒”)播放该第一声音信号,或者通过终端300的扬声器(也称“喇叭”)播放该第一声音信号。其中,上述“麦克风270C捕获的语音数据”具体为麦克风捕获的第一声音信号。The microphone 270C of the terminal 200 can receive voice data (ie, a first sound signal) sent by the user 210. The terminal 200 may convert a first sound signal into a first audio electrical signal. Then, the terminal 200 sends the first audio electrical signal to the terminal 300. The terminal 300 may convert the first audio electrical signal from the terminal 200 into a first sound signal, and play the first sound signal through a receiver (also referred to as a “handset”), or through a speaker of the terminal 300 (also referred to as a “horn”) Playing the first sound signal. The “voice data captured by the microphone 270C” is specifically the first sound signal captured by the microphone.
同样的,终端300的麦克风(也称“话筒”)可以接收用户310发出的语音数据(即第二声音信号)。终端300可以将第二声音信号转换为第二音频电信号。然后,终端300向终端200发送第二音频电信号。终端200可以将来自终端300的第二音频电信号转换为第二声音信号,并通过受话器270B(也称“听筒”)播放该第二声音信号,或者通过终端200的扬声器270A(也称“喇叭”)播放该第二声音信号。其中,上述“由接收自终端300的音频电信号转换而来的语音数据”具体为:终端200转换接收自终端300的第二音频电信号得到的第二声音信号。Similarly, a microphone (also referred to as a “microphone”) of the terminal 300 can receive voice data (ie, a second sound signal) sent by the user 310. The terminal 300 may convert the second sound signal into a second audio electrical signal. Then, the terminal 300 sends a second audio electric signal to the terminal 200. The terminal 200 may convert the second audio electrical signal from the terminal 300 into a second sound signal, and play the second sound signal through the receiver 270B (also referred to as a “handset”), or through the speaker 270A (also referred to as a “horn” ") Play the second sound signal. The “voice data converted from the audio electrical signals received from the terminal 300” is specifically: the second sound signal obtained by the terminal 200 converting the second audio electrical signals received from the terminal 300.
可选的,如图4所示,在S404之后,本申请实施例的方法还可以包括S405-S406:Optionally, as shown in FIG. 4, after S404, the method in the embodiment of the present application may further include S405-S406:
S405、在录制语音数据过程中,终端200判断麦克风270C捕获的第二语音数据对应的文本是否与预置结束词匹配。S405. During the recording of the voice data, the terminal 200 determines whether the text corresponding to the second voice data captured by the microphone 270C matches a preset ending word.
具体的,若麦克风270C捕获的第二语音数据对应的文本与预置结束醒词匹配,终端200则执行S406;若麦克风270C捕获的第二语音数据对应的文本与预置结束醒词不匹配,终端200则继续录制语音数据:Specifically, if the text corresponding to the second voice data captured by the microphone 270C matches the preset end wake word, the terminal 200 executes S406; if the text corresponding to the second voice data captured by the microphone 270C does not match the preset end wake word, The terminal 200 continues to record voice data:
S406、终端200停止录制语音数据。S406. The terminal 200 stops recording voice data.
示例性的,本申请实施例中的预置结束词可以是终端200出厂时,配置在终端200中的默认启动词。例如,手机200中的预置结束词可以为图5中的(c)所示的默认结束词“结束录音”。或者,该预置启动词可以是用户在终端200中设置的自定义启动词。例如,手机200响应于用户对图5中的(c)所示的“自定义结束词”选项512的点击操作,可以显示结束词自定义界面。结束词自定义界面用于设置结束词。结束词自定义界面的具体内容可以参考图6中所示的启动词自定义界面601。假设用户在结束词自定义界面为手机200(即终端200)设置自定义结束词“我记好了”。Exemplarily, the preset ending word in the embodiment of the present application may be a default starting word configured in the terminal 200 when the terminal 200 is shipped from the factory. For example, the preset ending word in the mobile phone 200 may be a default ending word “end recording” shown in (c) in FIG. 5. Alternatively, the preset start word may be a custom start word set by the user in the terminal 200. For example, the mobile phone 200 may display an end word customization interface in response to a user's click operation on the "custom end word" option 512 shown in (c) in FIG. 5. The end word custom interface is used to set the end word. For specific content of the end word customization interface, refer to the start word customization interface 601 shown in FIG. 6. Assume that the user sets a custom ending word "I remember" for the mobile phone 200 (ie, the terminal 200) in the ending word customization interface.
举例来说,本申请实施例中的预置结束词可以为“结束录音”、“我记好了”、“结束记录”、“记好了”和“记录完毕”等。本申请实施例中,以终端200的预置结束词为“我记好了”为例,对本申请实例的方法进行说明。For example, the preset ending words in the embodiment of the present application may be "end recording", "I remembered", "end recording", "remembered", "recording completed", and the like. In the embodiment of the present application, the method of the example of the present application is described by taking the preset ending word of the terminal 200 as "I remember well" as an example.
如图3所示,终端200与终端300进行语音通信的过程中,从终端200开始录制语音数据(即从t1时刻)开始,用户310发出语音数据3“深圳市南山区科发路1号”。终端300的麦克风可以捕获到该语音数据3,并将语音数据3(即声音信号)转换为音频电信号1。终端300向终端200发送音频电信号1。终端200将来自终端300的音频电信号1转换为声音信号(即语音数据3),终端300保存该语音数据3。终端200的麦克风270C则可以捕获到语音数据4“嗯”,并录制该语音数据4。终端200接收到终端300发送的音频电信号2。终端200将音频信号2转换为声音信息,即语音数据5“电话号码是88776655”。终端200保存该语音数据4。As shown in FIG. 3, during the voice communication between the terminal 200 and the terminal 300, since the terminal 200 starts recording voice data (that is, from time t1), the user 310 sends out voice data 3 "No. 1, Kefa Road, Nanshan District, Shenzhen" . The microphone of the terminal 300 can capture the voice data 3 and convert the voice data 3 (ie, a sound signal) into an audio electrical signal 1. The terminal 300 sends an audio electric signal 1 to the terminal 200. The terminal 200 converts the audio electrical signal 1 from the terminal 300 into a sound signal (ie, voice data 3), and the terminal 300 stores the voice data 3. The microphone 270C of the terminal 200 can capture the voice data 4 "um" and record the voice data 4. The terminal 200 receives the audio electric signal 2 sent by the terminal 300. The terminal 200 converts the audio signal 2 into sound information, that is, voice data 5 "the telephone number is 88767665". The terminal 200 stores the voice data 4.
其中,本申请实施例中,第二语音数据对应的文本与预置结束词匹配,具体为:第二语音数据对应的文本包括预置结束词。Wherein, in the embodiment of the present application, the text corresponding to the second voice data matches the preset ending word, and specifically, the text corresponding to the second voice data includes the preset ending word.
终端200的麦克风270C可以捕获到语音数据6“好的,我记好了”。终端200可以识别该语音数据6,得到该语音数据6的文本“好的,我记好了”,并确定语音数据6的文本中包括预置结束词“我记好了”。即终端200可以确定语音数据6的文本与预置结束词“我记好了”匹配。终端200确定语音数据6的文本与预置结束词“我记好了”匹配时,便可以结束录制语音通信对应的语音数据。如图3所示,终端200可以在t2时刻结束(即停止)录制语音通信对应的语音数据。The microphone 270C of the terminal 200 can capture the voice data 6 "OK, I remember well". The terminal 200 may recognize the voice data 6 and obtain a text "OK, I remember" of the voice data 6, and determine that the text of the voice data 6 includes a preset ending word "I remember". That is, the terminal 200 may determine that the text of the voice data 6 matches the preset ending word "I remembered". When the terminal 200 determines that the text of the voice data 6 matches the preset ending word “I remembered”, it can end recording the voice data corresponding to the voice communication. As shown in FIG. 3, the terminal 200 may end (that is, stop) recording the voice data corresponding to the voice communication at time t2.
本申请实施例中,终端200与终端300进行语音通信的过程中,当终端200的麦克风207C捕获的第一语音数据对应的文本与预置启动词文本匹配时,终端200可以自动开始录制语音通信对应的语音数据。换言之,终端200在接收用户210发出的语音命令(即文本与预置启动词文本匹配的第一语音数据)后,便可以自动开始录制语音通信对应的语音数据。In the embodiment of the present application, during the voice communication between the terminal 200 and the terminal 300, when the text corresponding to the first voice data captured by the microphone 207C of the terminal 200 matches the text of the preset start word, the terminal 200 may automatically start recording voice communication Corresponding voice data. In other words, after receiving the voice command (ie, the first voice data whose text matches the text of the preset start word) issued by the user 210, the terminal 200 can automatically start recording the voice data corresponding to the voice communication.
并且,在终端200录制语音数据的过程中,当终端200的麦克风207C捕获的第二语音数据对应的文本与预置结束词文本匹配时,终端200可以自动停止录制语音数据。换言之,终端200在接收用户210发出的语音命令(即文本与预置结束词文本匹配的第二语音数据)后,便可以自动停止录制语音数据。In addition, during the recording of the voice data by the terminal 200, when the text corresponding to the second voice data captured by the microphone 207C of the terminal 200 matches the text of the preset ending word, the terminal 200 may automatically stop recording the voice data. In other words, the terminal 200 can automatically stop recording the voice data after receiving the voice command (ie, the second voice data whose text matches the preset end word text) issued by the user 210.
综上所述,终端200响应于用户在语音通话过程中发出的语音命令,可以自动录制和停止录制语音数据。通过本方案,使得终端更加智能化,并且提高了终端与用户的交互性能,提升了用户体验。In summary, the terminal 200 can automatically record and stop recording voice data in response to a voice command issued by a user during a voice call. This solution makes the terminal more intelligent, improves the interaction performance between the terminal and the user, and improves the user experience.
其中,本申请实施例中的S405-S406是可选的。终端200可以在语音通信结束时,停止录制语音数据。或者,终端200可以在录制了预设时间(如1分钟、2分钟或者5分钟等)的语音数据后,自动停止录制语音数据。Among them, S405-S406 in the embodiments of the present application are optional. The terminal 200 may stop recording voice data when the voice communication ends. Alternatively, the terminal 200 may automatically stop recording the voice data after recording the voice data for a preset time (such as 1 minute, 2 minutes, or 5 minutes).
可选的,终端200还可以在开始录制语音通信对应的语音数据时,向用户提示终端200开始录制语音数据。具体的,在S403之后,如果麦克风270C捕获的第一语音数据对应的文本与预置启动词匹配,本申请实施例的方法还可以包括S701:Optionally, the terminal 200 may also prompt the user to start recording the voice data when the user starts recording the voice data corresponding to the voice communication. Specifically, after S403, if the text corresponding to the first voice data captured by the microphone 270C matches a preset start word, the method in this embodiment of the present application may further include S701:
S701、终端200发出第一提示信息。该第一提示信息用于提示用户终端200开始录制语音数据。S701. The terminal 200 sends a first prompt message. The first prompt information is used to prompt the user terminal 200 to start recording voice data.
示例性的,本申请实施例中的第一提示信息可以为提示音或者振动提示。例如,本申请实施例中的提示音可以为单音节提示音,如“叮”、“嘀”或者“嗖”等。或者,提示音可以为N秒长度铃音,如N等于2、3或5。或者,提示音为声音信号,如“开始 记录”。Exemplarily, the first prompt information in the embodiment of the present application may be a prompt sound or a vibration prompt. For example, the prompt tone in the embodiment of the present application may be a single-syllable prompt tone, such as "ding", "嘀", or "嗖". Alternatively, the prompt tone can be a ring tone of N seconds in length, for example, N is 2, 3, or 5. Alternatively, the tone is a sound signal, such as "Start recording".
本申请实施例中,终端200可以在第一语音数据对应的文本与预置启动词匹配时,即终端200开始录制语音数据时,发出第一提示信息,以提示用户终端200开始录制语音数据。如此,用户便可以通过该第一提示信息获知终端200已经开始录制语音数据,增加了终端200与用户直接的互动,提升了终端200与用户的交互性能,提升用户体验。In the embodiment of the present application, the terminal 200 may send a first prompt message when the text corresponding to the first voice data matches a preset start word, that is, when the terminal 200 starts recording voice data, to prompt the user terminal 200 to start recording voice data. In this way, the user can know through the first prompt information that the terminal 200 has started recording voice data, which increases the direct interaction between the terminal 200 and the user, improves the interaction performance between the terminal 200 and the user, and improves the user experience.
可选的,终端200还可以在停止录制语音数据时,向用户提示终端200停止录制语音数据。具体的,在S405之后,若麦克风270C捕获的第二语音数据对应的文本与预置结束醒词匹配,本申请实施例的方法还可以包括S702:Optionally, the terminal 200 may also prompt the user to stop recording the voice data when the terminal 200 stops recording the voice data. Specifically, after S405, if the text corresponding to the second voice data captured by the microphone 270C matches the preset end word, the method in this embodiment of the present application may further include S702:
S702、终端200发出第二提示信息。该第二提示信息用于提示用户终端200停止录制语音数据。S702: The terminal 200 sends a second prompt message. The second prompt information is used to prompt the user terminal 200 to stop recording voice data.
示例性的,本申请实施例中的第二提示信息可以为提示音或者振动提示。例如,本申请实施例中的提示音可以为单音节提示音,如“叮”、“嘀”或者“嗖”等。或者,提示音可以为N秒长度铃音,如N等于2、3或5。或者,提示音为声音信号,如“结束记录”。Exemplarily, the second prompt information in the embodiment of the present application may be a prompt sound or a vibration prompt. For example, the prompt tone in the embodiment of the present application may be a single-syllable prompt tone, such as "ding", "嘀", or "嗖". Alternatively, the prompt tone can be a ring tone of N seconds in length, for example, N is 2, 3, or 5. Alternatively, the prompt tone is a sound signal, such as "end recording".
其中,本申请实施例中的第二提示信息与第一提示信息可以相同,也可以不同。第二提示信息与第一提示信息不同,具体可以包括以下三种情况:The second prompt information and the first prompt information in the embodiment of the present application may be the same or different. The second prompt information is different from the first prompt information, and may specifically include the following three situations:
情况(1):第一提示信息为提示音,第二提示信息为振动提示。情况(2):第一提示信息和第二提示信息均为提示音,但第一提示信息对应的提示音和第二提示信息对应的提示音不同。例如,第一提示信息对应的提示音为“叮”,第二提示信息对应的提示音为“嘀”;或者,第一提示信息对应的提示音为“开始记录”,第二提示信息对应的提示音为“结束记录”。情况(3):第一提示信息和第二提示信息均为振动提示,但第一提示信息对应的振动提示和第二提示信息对应的振动提示的振动方式不同。例如。第一提示信息对应的振动提示的振动方式为单次振动,第二提示信息对应的振动提示的振动方式为连续两次振动。Case (1): The first prompt message is a prompt tone, and the second prompt message is a vibration prompt. Case (2): The first prompt information and the second prompt information are both prompt sounds, but the prompt sound corresponding to the first prompt information and the second prompt information are different. For example, the prompt sound corresponding to the first prompt information is "ding", the prompt sound corresponding to the second prompt information is "嘀"; or, the prompt sound corresponding to the first prompt information is "start recording", and the second prompt information corresponds to The beep is "End Record". Case (3): Both the first prompt information and the second prompt information are vibration prompts, but the vibration prompts corresponding to the first prompt information and the vibration prompts corresponding to the second prompt information have different vibration modes. E.g. The vibration mode of the vibration prompt corresponding to the first prompt information is a single vibration, and the vibration mode of the vibration prompt corresponding to the second prompt information is two consecutive vibrations.
本申请实施例中,终端200可以在第二语音数据对应的文本与预置启动词匹配时,即终端200停止录制语音数据时,发出第二提示信息,以提示用户终端200停止录制语音数据。如此,用户便可以通过该第而提示信息获知终端200已经停止录制语音数据,增加了终端200与用户直接的互动,提升了终端200与用户的交互性能,提升用户体验。In the embodiment of the present application, the terminal 200 may send a second prompt message when the text corresponding to the second voice data matches a preset start word, that is, when the terminal 200 stops recording voice data, to prompt the user terminal 200 to stop recording voice data. In this way, the user can know that the terminal 200 has stopped recording voice data through the prompt information, which increases the direct interaction between the terminal 200 and the user, improves the interaction performance between the terminal 200 and the user, and improves the user experience.
可以理解,在终端200使用扬声器270A播放语音数据的情况下,当上述第一提示信息或者第一提示信息为提示音时,终端200发出的提示音可能会被终端200的麦克风207C捕获到。在这种情况下,终端200会将麦克风207C捕获到的提示音发送到终端300。如此,用户310也可以听到该提示音,影响用户通话体验。It can be understood that when the terminal 200 uses the speaker 270A to play voice data, when the first prompt information or the first prompt information is a prompt tone, the prompt tone emitted by the terminal 200 may be captured by the microphone 207C of the terminal 200. In this case, the terminal 200 sends the prompt sound captured by the microphone 207C to the terminal 300. In this way, the user 310 can also hear the prompt tone, which affects the user's call experience.
本申请实施例中,为了避免终端200的语音通信对端用户(即用户310)听到终端200发出的提示音。终端200可以在麦克风270C捕获的第一语音数据对应的文本与预置启动词匹配时,判断终端200在使用扬声器270A播放语音数据还是在使用受话器270B播放语音数据。在终端200使用扬声器270A播放语音数据的情况下,终端200可以根据扬声器270A播放的语音数据对麦克风270C采集的语音数据进行回声抑 制。这样,回声抑制后的语音数据中则不会包括上述提示音。终端200向终端300发送的语音数据为回声抑制后的语音数据。终端200向终端300发送的语音数据则不会包括上述提示音。用户310则不会听到上述提示音。In the embodiment of the present application, in order to prevent the peer user (ie, the user 310) of the voice communication of the terminal 200 from hearing the prompt tone from the terminal 200. When the text corresponding to the first voice data captured by the microphone 270C matches the preset start word, the terminal 200 may determine whether the terminal 200 is playing the voice data using the speaker 270A or the voice receiver 270B. In the case where the terminal 200 uses the speaker 270A to play voice data, the terminal 200 may perform echo suppression on the voice data collected by the microphone 270C according to the voice data played by the speaker 270A. In this way, the voice data after the echo suppression will not include the above-mentioned prompt tone. The voice data sent by the terminal 200 to the terminal 300 is voice data after echo suppression. The voice data sent by the terminal 200 to the terminal 300 does not include the above-mentioned prompt tone. The user 310 will not hear the prompt tone.
进一步的,终端200可以在播放第二提示信息(即提示音)后,停止对麦克风270C采集的语音数据进行回声抑制。这样,可以避免终端200持续执行回声抑制所产生的功耗,延长终端200的续航时间。Further, the terminal 200 may stop performing echo suppression on the voice data collected by the microphone 270C after playing the second prompt information (that is, the prompt sound). In this way, the power consumption caused by the terminal 200 continuously performing echo suppression can be avoided, and the battery life of the terminal 200 can be extended.
其中,终端200录制的语音数据可以包括一段语音数据或者至少两段语音数据。例如,结合图3对应的实例。终端200可以录制三段语音数据:“深圳市南山区科发路1号”、“嗯”和“电话号码是88776655”。The voice data recorded by the terminal 200 may include one piece of voice data or at least two pieces of voice data. For example, in conjunction with the example corresponding to FIG. 3. The terminal 200 can record three pieces of voice data: "No. 1, Kefa Road, Nanshan District, Shenzhen", "Hmm", and "The phone number is 88767655".
本申请实施例中,终端200可以按照录制每段语音数据的时间先后顺序,以及每段语音数据的来源信息,保存上述录制的语音数据。其中,上述来源信息用于指示语音数据是麦克风270C捕获的语音数据,或者由接收自终端300的音频电信号转换而来的语音数据。In the embodiment of the present application, the terminal 200 may save the recorded voice data according to the chronological order of recording each piece of voice data and the source information of each piece of voice data. The source information is used to indicate that the voice data is voice data captured by the microphone 270C, or voice data converted from audio and electrical signals received from the terminal 300.
例如,终端200录制上述三段语音数据的时间先后顺序为:语音数据3“深圳市南山区科发路1号”、语音数据4“嗯”和语音数据5“电话号码是88776655”。语音数据3“深圳市南山区科发路1号”和语音数据5“电话号码是88776655”是由接收自终端300的音频电信号转换而来的语音数据。语音数据4“嗯”是终端200的麦克风270C捕获的语音数据。For example, the time sequence of recording the above three pieces of voice data by the terminal 200 is: voice data 3 “No. 1 Kefa Road, Nanshan District, Shenzhen”, voice data 4 “um”, and voice data 5 “phone number is 88767655”. The voice data 3 “No. 1 Kefa Road, Nanshan District, Shenzhen” and the voice data 5 “the telephone number is 88767655” are voice data converted from the audio and electric signals received from the terminal 300. The voice data 4 "um" is voice data captured by the microphone 270C of the terminal 200.
示例性的,终端200可以采用表格的方式保存上述语音通信过程中录制的语音数据。假设终端200与终端300进行图6所示的语音通信的通话时间为2018年8月8日08:06,通话时长为21分钟。如表1所示,为本申请实施例示出的一种语音数据表实例:Exemplarily, the terminal 200 may save the voice data recorded during the voice communication process in a table manner. Assume that the talk time between the terminal 200 and the terminal 300 for the voice communication shown in FIG. 6 is 08:06 on August 8, 2018, and the call duration is 21 minutes. As shown in Table 1, an example of a voice data table shown in this embodiment of the present application:
表1Table 1
Figure PCTCN2018105571-appb-000001
Figure PCTCN2018105571-appb-000001
如表1所示,语音数据表中保存了终端200与终端300(电话号码为138****5678)通话过程中录制的语音数据。表1中按照录制语音数据的时间先后顺序和语音数据的来源依次保存了电话号码为138****5678,通话时间为2018年8月8日08:06的语音通话过程中,终端200录制的语音数据3“深圳市南山区科发路1号”、语音数据4“嗯”和语音数据5“电话号码是88776655”。并且,表1中还标注了各个语音数据的来源。如语音数据3和语音数据5的来源为对方。即语音数据3和语音数据5是由接收自终端300的音频电信号转换而来的语音数据。语音数据4的来源为本机。即语音数据4是终端200的麦克风270C捕获的语音数据。As shown in Table 1, the voice data table stores voice data recorded during the call between the terminal 200 and the terminal 300 (phone number is 138 **** 5678). In Table 1, according to the chronological order of the recorded voice data and the source of the voice data, the phone number is 138 **** 5678 and the call time is 08:06, August 8, 2018. During the voice call, the terminal 200 records Voice data 3 "No. 1 Kefa Road, Nanshan District, Shenzhen", voice data 4 "um" and voice data 5 "phone number is 88767655". In addition, Table 1 also indicates the source of each voice data. For example, the source of voice data 3 and voice data 5 is the other party. That is, the voice data 3 and the voice data 5 are voice data converted from audio electric signals received from the terminal 300. The source of the voice data 4 is local. That is, the voice data 4 is voice data captured by the microphone 270C of the terminal 200.
在一种实现方式中,终端200可以按照语音数据的来源来对终端录制的语音数据 进行分段。具体的,结合图3所示的实例,终端200录制上述语音数据3(由接收自终端300的音频电信号转换而来的语音数据)后,终端200的麦克风270C捕获到上述语音数据4。由于语音数据3的来源与语音数据4不同;因此,终端200可以将语音数据3和语音数据4分段保存。终端200的麦克风270C捕获到上述语音数据4后,终端200接收到来自终端300的音频电信号(语音数据5对应的音频电信号)。由于语音数据4和语音数据5的来源不同;因此,终端200可以将语音数据4和语音数据5分段保存。In one implementation, the terminal 200 may segment the voice data recorded by the terminal according to the source of the voice data. Specifically, in conjunction with the example shown in FIG. 3, after the terminal 200 records the voice data 3 (speech data converted from the audio and electrical signals received from the terminal 300), the microphone 270C of the terminal 200 captures the voice data 4. Since the source of the voice data 3 is different from the voice data 4; therefore, the terminal 200 can save the voice data 3 and the voice data 4 in segments. After the above-mentioned voice data 4 is captured by the microphone 270C of the terminal 200, the terminal 200 receives an audio electric signal (an audio electric signal corresponding to the voice data 5) from the terminal 300. Since the sources of the voice data 4 and the voice data 5 are different; therefore, the terminal 200 can save the voice data 4 and the voice data 5 in segments.
在一些应用场景中,终端200与终端300进行语音通信的过程中,用户210发出一段语音数据(如语音数据a)后,用户210和用户310都会沉默一段时间,然后用户210再发出一段语音数据(如语音数据b)。或者,终端200与终端300进行语音通信的过程中,用户310发出一段语音数据后,用户210和用户310都会沉默一段时间,然后用户310再发出一段语音数据。在这种场景中,终端200对终端录制的语音数据进行分段时,不仅可以参考语音数据的来源,还可以参考语音数据的间隔时间。例如,终端200与终端300进行语音通信的过程中,用户210发出一段语音数据(如语音数据a),终端200的麦克风270C捕获到该语音数据a。麦克风270C捕获到该语音数据a后,如果在一定时间(如1分钟、2分钟或者5分钟等)内,麦克风270C没有捕获到新的语音数据,终端200也没有接收到来自终端300的音频电信号,在该一定时间之后,麦克风270C捕获到语音数据b,终端200可以将语音数据a和语音数据b分段保存。终端200的麦克风270C捕获到上述语音数据b后,终端200接收到来自终端300的音频电信号(语音数据c对应的音频电信号)。由于语音数据b和语音数据c的来源不同;因此,终端200可以将语音数据b和语音数据c分段保存。In some application scenarios, during the voice communication between the terminal 200 and the terminal 300, after the user 210 sends out a piece of voice data (such as the voice data a), the user 210 and the user 310 are silent for a period of time, and then the user 210 sends out another piece of voice data (Such as voice data b). Alternatively, during the voice communication between the terminal 200 and the terminal 300, after the user 310 sends out a piece of voice data, the user 210 and the user 310 are silent for a period of time, and then the user 310 sends out another piece of voice data. In this scenario, when the terminal 200 segments the voice data recorded by the terminal, not only the source of the voice data but also the interval time of the voice data can be referred to. For example, during the voice communication between the terminal 200 and the terminal 300, the user 210 sends out a piece of voice data (such as voice data a), and the microphone 270C of the terminal 200 captures the voice data a. After the microphone 270C captures the voice data a, if within a certain period of time (such as 1 minute, 2 minutes, or 5 minutes, etc.), the microphone 270C does not capture new voice data, and the terminal 200 does not receive audio data from the terminal 300. Signal, after the certain time, the voice data b is captured by the microphone 270C, and the terminal 200 may save the voice data a and the voice data b in segments. After the above-mentioned voice data b is captured by the microphone 270C of the terminal 200, the terminal 200 receives an audio electric signal (an audio electric signal corresponding to the voice data c) from the terminal 300. Since the sources of the voice data b and the voice data c are different; therefore, the terminal 200 may save the voice data b and the voice data c in segments.
示例性的,终端200可以采用表格的方式保存上述语音通信过程中录制的语音数据a、语音数据b和语音数据c。如表2所示,为本申请实施例示出的一种语音数据表实例:Exemplarily, the terminal 200 may save the voice data a, voice data b, and voice data c recorded in the foregoing voice communication process in a table manner. As shown in Table 2, an example of a voice data table shown in this embodiment of the present application:
表2Table 2
Figure PCTCN2018105571-appb-000002
Figure PCTCN2018105571-appb-000002
表2中按照录制语音数据的时间先后顺序和语音数据的来源依次保存了电话号码为138****5678,通话时间为2018年8月8日08:06的语音通话过程中,终端200录制的语音数据a、语音数据b和语音数据c。并且,表1中还标注了各个语音数据的来源。如语音数据a的来源为本机,语音数据b的来源为本机,语音数据c的来源为对方。即语音数据a和语音数据b是终端200的麦克风270C捕获的语音数据,语音数据c是由接收自终端300的音频电信号转换而来的语音数据。In Table 2, according to the chronological order of the recorded voice data and the source of the voice data, the phone number is 138 **** 5678 and the call time is 08:06, August 8, 2018. During the voice call, the terminal 200 records Voice data a, voice data b, and voice data c. In addition, Table 1 also indicates the source of each voice data. For example, the source of voice data a is the local machine, the source of voice data b is the local machine, and the source of voice data c is the other party. That is, the voice data a and the voice data b are voice data captured by the microphone 270C of the terminal 200, and the voice data c is voice data converted from audio electric signals received from the terminal 300.
上述语音数据可以与保存对应语音通信的通话记录信息保存在一起。即终端200可以将上述语音数据保存在用于保存通话记录信息的存储区域。或者,上述语音数据 可以保存在一个单独的存储区域。本申请实施例这里不予赘述。如果终端200将上述语音数据与通话记录信息保存在一起,那么可能会因为终端200在一些语音通信过程中录制了语音数据,在另一些语音通信过程中没有录制语音数据,使得部分通话记录对应的信息中包括终端200录制的语音数据,部分通讯记录对应的信息中不包括终端200录制的语音数据。例如,如表3所示,为本申请实施例示出的一种通话记录信息表实例:The above-mentioned voice data may be stored together with the call log information corresponding to the corresponding voice communication. That is, the terminal 200 may save the above-mentioned voice data in a storage area for storing call record information. Alternatively, the above voice data can be stored in a separate storage area. This embodiment of the present application will not repeat them here. If the terminal 200 stores the above-mentioned voice data and call log information together, it may be because the terminal 200 records voice data during some voice communication processes, and does not record voice data during other voice communication processes, so that some call records correspond to The information includes the voice data recorded by the terminal 200, and the information corresponding to some communication records does not include the voice data recorded by the terminal 200. For example, as shown in Table 3, an example of a call record information table shown in this embodiment of the present application:
表3table 3
Figure PCTCN2018105571-appb-000003
Figure PCTCN2018105571-appb-000003
在表3所示的通话记录信息表中,电话号码为138****5678,通话时间为2018年8月8日08:06的语音通话的通话记录信息中包括对应通话过程中录制的语音数据3-语音数据5。终端200在电话号码为180****1234,通话时间为2018年8月8日10:01的语音通话过程中,并未录制语音数据;因此,对应的通话记录信息中未保存语音数据。如表2所示,电话号码为180****1234,通话时间为2018年8月8日10:01的语音通话的通话记录信息的语音数据和来源这两项可以为空(如NULL)。In the call log information table shown in Table 3, the phone number is 138 **** 5678 and the call time is 08:06, August 8, 2018. The call log information of the voice call includes the voice recorded during the corresponding call. Data 3-voice data 5. The terminal 200 did not record voice data during a voice call with a phone number of 180 **** 1234 and a call time of 10:01 on August 8, 2018; therefore, no voice data was stored in the corresponding call log information. As shown in Table 2, the phone number is 180 **** 1234, and the call time is 10:01 on August 8, 2018. The voice data and source of the call log information of the voice call can be empty (such as NULL). .
相比于表1所示的语音数据表,表3所示的通话记录信息表中不仅可以包括语音通话对应的电话号码、通话时间、语音数据和语音数据的来源信息,还可以包括语音通话的通话时间等信息。Compared with the voice data table shown in Table 1, the call record information table shown in Table 3 can include not only the phone number corresponding to the voice call, the call time, the voice data and the source information of the voice data, but also the information of the voice call. Talk time and other information.
本申请实施例提供一种通信过程中信息的显示方法。终端200在语音数据录制结束后,显示包括用于播放语音数据的播放器插件的第一界面。终端200可以接收用户对播放器插件的第一操作,响应于第一操作播放对应的语音数据。An embodiment of the present application provides a method for displaying information during communication. After the voice data recording ends, the terminal 200 displays a first interface including a player plug-in for playing voice data. The terminal 200 may receive a first operation of the player plug-in by the user, and play corresponding voice data in response to the first operation.
在实现方式(1)中,终端200在语音数据录制结束后,响应于语音通信结束,可以自动显示第一界面。In implementation manner (1), after the voice data recording ends, the terminal 200 may automatically display the first interface in response to the end of the voice communication.
示例性的,如图7所示,终端200响应于语音通信结束,可以显示图7所示的第一界面701。该第一界面701中可以包括用于播放终端200录制的语音数据的播放器插件。例如,第一界面701中播放器插件702、播放器插件703和播放器插件704。其中,播放器插件702用于播放上述语音数据3。播放器插件703用于播放上述语音数据4。播放器插件704用于播放上述语音数据5。当然,第一界面701中也可以只包括一个播放器插件,该播放器插件可以用于播放终端200在语音通信过程中录制的所有语音数据,如语音数据3、语音数据4和语音数据5。For example, as shown in FIG. 7, the terminal 200 may display the first interface 701 shown in FIG. 7 in response to the end of the voice communication. The first interface 701 may include a player plug-in for playing voice data recorded by the terminal 200. For example, the player plug-in 702, the player plug-in 703, and the player plug-in 704 in the first interface 701. The player plug-in 702 is configured to play the voice data 3 described above. The player plug-in 703 is configured to play the voice data 4 described above. The player plug-in 704 is configured to play the voice data 5 described above. Of course, the first interface 701 may also include only one player plug-in, and the player plug-in may be used to play all the voice data recorded by the terminal 200 during the voice communication, such as voice data 3, voice data 4, and voice data 5.
在实现方式(2)中,终端200在语音数据录制结束后,可以自动显示第一界面。终端200语音数据录制结束后,终端200可能还在与终端300进行语音通信。这种情况下,终端200在语音数据录制结束后,在语音通信过程中,显示第一界面。In implementation manner (2), the terminal 200 may automatically display the first interface after the voice data recording ends. After the recording of the voice data of the terminal 200 ends, the terminal 200 may still be performing voice communication with the terminal 300. In this case, after the voice data recording ends, the terminal 200 displays the first interface during the voice communication.
在实现方式(3)中,在语音数据录制结束后,终端200响应于用户输入第二操作, 显示第一界面。该第二操作用于触发终端200显示终端200的通话记录界面,即第一界面。其中,终端200可以接收用户输入的第二操作。响应于第二操作,终端200显示终端200的通话记录界面。该通话记录界面中可以包括一个或多个通话记录项。一个通话记录项可以对应于一条通话记录。一条通话记录可以记录一次语音通信的对端通信号码、联系人信息(如联系人姓名或者备注名)、通话开始或者结束时间、以及通话时长等。例如,本申请实施例中的通话记录界面中包括终端200与终端300进行语音通信的通话记录项。该通话记录项用于记录终端200与终端300进行语音通信的通话记录信息,如终端300的电话号码138****5678。In implementation manner (3), after the voice data recording ends, the terminal 200 displays the first interface in response to the user inputting the second operation. This second operation is used to trigger the terminal 200 to display the call history interface of the terminal 200, that is, the first interface. The terminal 200 may receive a second operation input by a user. In response to the second operation, the terminal 200 displays a call history interface of the terminal 200. The call history interface may include one or more call history items. A call log entry may correspond to a call log. A call record can record the peer communication number of a voice communication, contact information (such as contact name or remark name), the start or end time of a call, and the duration of a call. For example, the call record interface in the embodiment of the present application includes a call record item in which the terminal 200 and the terminal 300 perform voice communication. The call record item is used to record call record information of the terminal 200 and the terminal 300 for voice communication, such as the phone number 138 **** 5678 of the terminal 300.
示例性的,第二操作可以是用户对图8中的(a)所示的手机桌面801中的“电话”图标802的点击操作(如单击操作)。Exemplarily, the second operation may be a user's click operation (such as a click operation) on the "phone" icon 802 in the mobile phone desktop 801 shown in (a) of FIG. 8.
本申请实施例中,终端200与终端300进行语音通信的通话记录项中还可以包括播放器插件。该播放器插件用于播放终端200录制的语音数据。In the embodiment of the present application, the call record items of the voice communication between the terminal 200 and the terminal 300 may further include a player plug-in. The player plug-in is used to play voice data recorded by the terminal 200.
假设终端200中保持了三条通话记录,即图8中的(b)所示的通话记录项804对应的通话记录、通话记录项805对应的通话记录和通话记录项806对应的通话记录。响应于用户对“电话”图标802的点击操作,终端200可以显示图8中的(b)所示的通话记录界面803。通话记录界面803中包括通话记录项804、通话记录项805和通话记录项806。如表1所示,终端200中保存了终端200与电话号码138****5678对应的终端进行语音通信时录制的语音数据。那么,通话记录项804则可以包括用于播放表2中的语音数据3、语音数据4和语音数据5的播放器插件807。终端200响应于用户对播放器插件807的点击操作(如单击操作),可以依次播放表1中的语音数据3、语音数据4和语音数据5。如表1所示,终端200中保存了终端200与电话号码159****7986对应的终端进行语音通信时录制的语音数据。那么,通话记录项804则可以包括用于播放表2中的语音数据7的播放器插件808。终端200响应于用户对播放器插件808的点击操作(如单击操作),可以依次播放表1中的语音数据7。It is assumed that three call records are maintained in the terminal 200, that is, a call record corresponding to the call record item 804 shown in FIG. 8 (b), a call record corresponding to the call record item 805, and a call record corresponding to the call record item 806. In response to the user's click operation on the "phone" icon 802, the terminal 200 may display the call history interface 803 shown in (b) of FIG. The call history interface 803 includes a call history entry 804, a call history entry 805, and a call history entry 806. As shown in Table 1, the terminal 200 stores voice data recorded during the voice communication between the terminal 200 and the terminal corresponding to the phone number 138 **** 5678. Then, the call record entry 804 may include a player plug-in 807 for playing the voice data 3, the voice data 4, and the voice data 5 in the table 2. In response to a user's click operation (such as a click operation) on the player plug-in 807, the terminal 200 can sequentially play the voice data 3, voice data 4, and voice data 5 in Table 1. As shown in Table 1, the terminal 200 stores voice data recorded during the voice communication between the terminal 200 and the terminal corresponding to the phone number 159 **** 7986. Then, the call record entry 804 may include a player plug-in 808 for playing the voice data 7 in the table 2. In response to the user's click operation (such as a click operation) on the player plug-in 808, the terminal 200 can sequentially play the voice data 7 in Table 1.
可以理解,终端200在与电话号码为159****7986对应的终端进行语音通信的过程中,没有录制语音数据。因此,通话记录项806中不显示播放器插件。It can be understood that, during the voice communication between the terminal 200 and the terminal corresponding to the phone number 159 **** 7986, no voice data is recorded. Therefore, the player plug-in is not displayed in the call log item 806.
其中,在本申请中,通话记录界面中的通话记录项中包括用于播放对应语音通信过程中录制的语音数据的播放器插件。如此,终端200可以响应于用户对播放器插件的点击操作,播放对应的语音数据。并且,终端200可以在通话记录界面向用户展示播放器插件播放的语音数据与通话记录的关系,提高了终端200录制的语音数据与通话记录的关联性。Wherein, in the present application, the call record item in the call record interface includes a player plug-in for playing voice data recorded during a corresponding voice communication process. In this way, the terminal 200 can play the corresponding voice data in response to a user's click operation on the player plug-in. In addition, the terminal 200 can show the user the relationship between the voice data played by the player plug-in and the call history on the call history interface, thereby improving the correlation between the voice data recorded by the terminal 200 and the call history.
在实现方式(4)中,通话记录界面中的通话记录项中可以不包括上述播放器插件。在这种实现方式中,在语音数据录制结束后,终端200响应于用户对通话记录界面中语音通话的通话记录项的第三操作,显示第一界面。该第三操作用于触发终端200显示该通话记录项对应的记录详情界面,即第一界面。该记录详情界面中包括至少一个播放器插件。该至少一个播放器插件与上述至少两段语音数据一一对应。该至少一个播放器插件按照录制对应语音数据的时间先后顺序,以及对应语音数据的来源信息,显示在记录详情界面中。In implementation manner (4), the call record item in the call record interface may not include the above-mentioned player plug-in. In this implementation manner, after the recording of the voice data ends, the terminal 200 displays the first interface in response to the user's third operation on the call record item of the voice call in the call record interface. The third operation is used to trigger the terminal 200 to display a record detail interface corresponding to the call record item, that is, a first interface. The record details interface includes at least one player plug-in. The at least one player plug-in corresponds one-to-one with the at least two pieces of voice data. The at least one player plug-in is displayed in the record details interface according to the chronological order of recording the corresponding voice data and the source information of the corresponding voice data.
例如,响应于用户对图8中的(a)所示的手机桌面801中的“电话”图标802的点 击操作,终端200可以显示图9中的(a)所示的通话记录界面901。通话记录界面901中包括通话记录项902、通话记录项903和通话记录项904。其中,通话记录项902、通话记录项903和通话记录项904中均不包括播放器插件。但是,响应于用户对通话记录项902的第三操作,终端200可以显示图9中的(b)所示的记录详情界面905。该记录详情界面905中包括播放器插件906、播放器插件907和播放器插件908。其中,播放器插件906、播放器插件907和播放器插件908在记录详情界面905中,按照其播放的语音数据被录制的先后顺序排列。响应于用户对播放器插件906的单击操作,终端200可以播放表1所示的语音数据3“深圳市南山区科发路1号”。响应于用户对播放器插件907的单击操作,终端200可以播放表1所示的语音数据4“嗯”。响应于用户对播放器插件908的单击操作,终端200可以播放表1所示的语音数据5“电话号码是88776655”。For example, in response to a user's click operation on the "phone" icon 802 in the mobile phone desktop 801 shown in (a) of Fig. 8, the terminal 200 may display the call record interface 901 shown in (a) of Fig. 9. The call history interface 901 includes a call history entry 902, a call history entry 903, and a call history entry 904. The call log item 902, the call log item 903, and the call log item 904 do not include a player plug-in. However, in response to the user's third operation on the call record item 902, the terminal 200 may display a record detail interface 905 shown in (b) of FIG. 9. The record details interface 905 includes a player plug-in 906, a player plug-in 907, and a player plug-in 908. Among them, the player plug-in 906, the player plug-in 907, and the player plug-in 908 are arranged in the recording detail interface 905 according to the sequence in which the voice data that they play is recorded. In response to the user's click operation on the player plug-in 906, the terminal 200 can play the voice data 3 "No. 1, Kefa Road, Nanshan District, Shenzhen" shown in Table 1. In response to the user's click operation on the player plug-in 907, the terminal 200 can play the voice data 4 "hmm" shown in Table 1. In response to the user's single-click operation on the player plug-in 908, the terminal 200 can play the voice data 5 "phone number is 88767655" shown in Table 1.
其中,在本申请中,通话记录界面中的通话记录项中包括用于播放对应语音通信过程中录制的语音数据的播放器插件。如此,终端200可以响应于用户对播放器插件的点击操作,播放对应的语音数据。并且,终端200可以直观的在通话记录界面向用户展示播放器插件播放的语音数据与通话记录的关系,提高了终端200录制的语音数据与通话记录的关联性。Wherein, in the present application, the call record item in the call record interface includes a player plug-in for playing voice data recorded during a corresponding voice communication process. In this way, the terminal 200 can play the corresponding voice data in response to a user's click operation on the player plug-in. In addition, the terminal 200 can intuitively show the user the relationship between the voice data played by the player plug-in and the call history on the call history interface, thereby improving the correlation between the voice data recorded by the terminal 200 and the call history.
在本申请实施例中,终端200可以将录制的语音数据转换为文本信息,即录制的语音数据的文本。终端200录制的语音数据可以为至少两段语音数据。录制的语音数据的文本可以包括至少两段文本信息。至少两段文本信息与至少两段语音数据一一对应。其中,终端200可以按照录制文本信息对应的语音数据的时间先后顺序,以及文本信息对应的语音数据的来源信息,保存至少两段文本信息。In the embodiment of the present application, the terminal 200 may convert the recorded voice data into text information, that is, the text of the recorded voice data. The voice data recorded by the terminal 200 may be at least two pieces of voice data. The text of the recorded voice data may include at least two pieces of text information. At least two pieces of text information correspond to at least two pieces of voice data. The terminal 200 may store at least two pieces of text information according to the chronological order of the voice data corresponding to the recorded text information and the source information of the voice data corresponding to the text information.
示例性的,终端200可以采用表格的方式保存上述语音通信过程中录制的语音数据和每段语音数据对应的文本信息。结合表1,如表4所示,为本申请实施例示出的一种语音数据和文本信息表实例:Exemplarily, the terminal 200 may save the voice data recorded during the above-mentioned voice communication and the text information corresponding to each piece of voice data in a table manner. With reference to Table 1, as shown in Table 4, an example of a voice data and text information table shown in this embodiment of the present application:
表4Table 4
Figure PCTCN2018105571-appb-000004
Figure PCTCN2018105571-appb-000004
如表2所示,语音数据和文本信息表中保存了终端200与终端300(电话号码为138****5678)通话过程中录制的语音数据及其对应的文本信息。As shown in Table 2, the voice data and text information table stores the voice data recorded during the call between the terminal 200 and the terminal 300 (phone number is 138 **** 5678) and the corresponding text information.
如图10A所示,本申请实施例提供的一种通信过程中信息的显示方法可以包括S1001-S1002。如图10B所示,本申请实施例提供的一种通信过程中信息的记录及显示 方法可以包括S401-S404、S404'和S1001-S1002。As shown in FIG. 10A, a method for displaying information during a communication process provided by an embodiment of the present application may include S1001-S1002. As shown in FIG. 10B, a method for recording and displaying information in a communication process provided by the embodiment of the present application may include S401-S404, S404 ', and S1001-S1002.
S1001、在语音数据录制结束后,终端200显示包括录制的语音数据对应的文本的第一界面。该语音数据是终端200与终端300进行语音通信的过程中终端200录制的语音数据。S1001. After the recording of the voice data ends, the terminal 200 displays a first interface including text corresponding to the recorded voice data. The voice data is voice data recorded by the terminal 200 during the voice communication between the terminal 200 and the terminal 300.
其中,第一界面,以及终端200响应于第一事件显示第一界面的方法可以参考上述实现方式(1)-实现方式(4)中的详细描述,本申请实施例这里不再赘述。不同的是,该实施例中第一界面中可以语音数据对应的文本。在本实施例中,第一界面中可以包括播放器插件,也可以不包括播放器插件。For the first interface and the method for the terminal 200 to display the first interface in response to the first event, reference may be made to the detailed descriptions in the implementation manner (1) to the implementation manner (4), which are not repeatedly described in the embodiment of the present application. The difference is that the text corresponding to the voice data can be used in the first interface in this embodiment. In this embodiment, the first interface may or may not include a player plug-in.
其中,上述语音数据包括至少两段语音数据。例如,该语音数据可以包括上述语音数据3、语音数据4和语音数据5。上述语音数据对应的文本包括至少两段文本信息。至少两段语音数据与至少两段文本信息一一对应。本申请实施例中,终端200录制至少两段语音数据,保存至少两段语音数据与至少两段文本信息的方式可以参考上述实施例中的详细描述,本申请实施例这里不再赘述。The voice data includes at least two pieces of voice data. For example, the voice data may include the aforementioned voice data 3, voice data 4, and voice data 5. The text corresponding to the voice data includes at least two pieces of text information. At least two pieces of voice data correspond to at least two pieces of text information. In the embodiment of the present application, for a manner in which the terminal 200 records at least two pieces of voice data, and stores at least two pieces of voice data and at least two pieces of text information, reference may be made to the detailed description in the foregoing embodiment, which is not repeatedly described in this embodiment of the application.
示例性的,以第一界面是上述记录详情界面为例。终端200可以接收用户对终端200与终端300进行语音通信的通话记录项的第三操作。该第三操作用于触发终端200显示该通话记录项对应的记录详情界面。该记录详情界面中包括至少两段文本信息。该至少两段文本信息与上述至少两段语音数据一一对应。例如,响应于用户对通话记录项902的第三操作,终端200可以显示图10C所示的记录详情界面1001。该记录详情界面1001包括表2所示的语音数据3对应的文本信息“深圳市南山区科发路1号”1002、语音数据4对应的文本信息“嗯”1003和语音数据5对应的文本信息“电话号码是88776655”1004。并且,文本信息“深圳市南山区科发路1号”1002、文本信息“嗯”1003和文本信息“电话号码是88776655”1004按照其对应的语音数据被录制的先后顺序排列。并且,记录详情界面1001中还指出了每个文本信息的来源。例如,文本信息“深圳市南山区科发路1号”1002来自对方,即文本信息“深圳市南山区科发路1号”1002是由接收自终端300的音频电信号转换而来的语音数据对应的文本信息。文本信息“嗯”1003来自本机,即文本信息“嗯”1003是麦克风270C捕获的语音数据对应的文本信息。文本信息“电话号码是88776655”1004来自对方,即文本信息“电话号码是88776655”1004是由接收自终端300的音频电信号转换而来的语音数据对应的文本信息。Exemplarily, the first interface is the foregoing record detail interface as an example. The terminal 200 may receive a third operation of a call record item in which the user performs voice communication between the terminal 200 and the terminal 300. The third operation is used to trigger the terminal 200 to display a record detail interface corresponding to the call record item. The record details interface includes at least two pieces of text information. The at least two pieces of text information correspond to the at least two pieces of voice data one-to-one. For example, in response to the user's third operation on the call record item 902, the terminal 200 may display a record detail interface 1001 shown in FIG. 10C. The record details interface 1001 includes text information corresponding to the voice data 3 shown in Table 2 "No. 1, Kefa Road, Nanshan District, Shenzhen" 1002, text information corresponding to the voice data 4 "um" 1003, and text information corresponding to the voice data 5. "The phone number is 88767655" 1004. In addition, the text information "No. 1 Kefa Road, Nanshan District, Shenzhen" 1002, the text information "um" 1003, and the text information "phone number is 88767665" 1004 are arranged in the order in which their corresponding voice data were recorded. In addition, the record details interface 1001 also indicates the source of each text message. For example, the text message "No. 1 Kefa Road, Nanshan District, Shenzhen" 1002 comes from the other party, that is, the text message "No. 1 Kefa Road, Nanshan District, Shenzhen" 1002 is voice data converted from the audio and electrical signals received from the terminal 300 Corresponding text information. The text information "um" 1003 comes from the local machine, that is, the text information "um" 1003 is the text information corresponding to the voice data captured by the microphone 270C. The text message "phone number is 88767655" 1004 comes from the counterparty, that is, the text message "phone number is 88767655" 1004 is text information corresponding to voice data converted from audio and electrical signals received from the terminal 300.
本申请实施例中,终端200可以在通话记录的记录详情界面中显示终端200录制的语音数据对应的文本信息。即终端200可以直观的在通话记录界面向用户展示终端200录制的语音数据对应的文本信息,以及文本信息与通话记录的关系,提高了文本信息与通话记录的关联性。In the embodiment of the present application, the terminal 200 may display text information corresponding to the voice data recorded by the terminal 200 in a record detail interface of the call record. That is, the terminal 200 can intuitively display the text information corresponding to the voice data recorded by the terminal 200 and the relationship between the text information and the call record on the call record interface, thereby improving the relevance of the text information and the call record.
S1002、终端200可以接收用户对上述语音数据对应的文本的第一操作,响应于第一操作播放对应的语音数据。S1002. The terminal 200 may receive a user's first operation on the text corresponding to the voice data, and play the corresponding voice data in response to the first operation.
具体的,终端200还可以接收用户对记录详情界面中至少两段文本信息中任一段文本信息(如第一文本信息)的第一操作。该第一操作用户触发终端200播放第一文本信息对应的语音数据段。例如,该第一操作可以是用户对第一文本信息的单击操作、双击操作或者长按操作等任一种操作。响应于用户对第一文本信息的第一操作,终端 200可以播放第一文本信息对应的第一语音数据段。第一语音数据段是至少两段语音数据中的一段语音数据。Specifically, the terminal 200 may also receive a user's first operation on any piece of text information (such as the first text information) in at least two pieces of text information in the record detail interface. The first operation user triggers the terminal 200 to play a voice data segment corresponding to the first text information. For example, the first operation may be any operation such as a user's single-click operation, double-click operation, or long-press operation on the first text information. In response to the user's first operation on the first text information, the terminal 200 may play a first voice data segment corresponding to the first text information. The first speech data segment is a piece of speech data among at least two pieces of speech data.
示例性的,终端200可以接收用户对图10C所示的记录详情界面1001中的第一文本信息“深圳市南山区科发路1号”1002的第一操作。响应于用户对第一文本信息“深圳市南山区科发路1号”1002的第一操作,如图10C所示,终端200可以语音数据段“深圳市南山区科发路1号”。Exemplarily, the terminal 200 may receive a user's first operation on the first text information “No. 1 Kefa Road, Nanshan District, Shenzhen” 1002 in the record detail interface 1001 shown in FIG. 10C. In response to the user's first operation on the first text information "No. 1 Kefa Road, Nanshan District, Shenzhen" 1002, as shown in Fig. 10C, the terminal 200 may have a voice data segment "No. 1 Kefa Road, Nanshan District, Shenzhen".
可选的,上述记录详情界面中不仅可以包括至少两段文本信息,还可以包括至少两个播放器插件。至少两个播放器插件与至少两段文本信息一一对应。该至少两个播放器插件用于播放至少两段文本信息对应的语音数据。例如,如图11所示,记录详情界面1101中不仅可以包括文本信息“深圳市南山区科发路1号”1102、文本信息“嗯”1103和文本信息“电话号码是88776655”1104,还可以包括播放器插件1105、播放器插件1106和播放器插件1107。其中,播放器插件1105用于播放文本信息1102对应的语音数据“深圳市南山区科发路1号”。播放器插件1106用于播放文本信息1103对应的语音数据“嗯”。播放器插件1107用于播放文本信息1104对应的语音数据“电话号码是88776655”。Optionally, the above record detail interface may include not only at least two pieces of text information, but also at least two player plug-ins. At least two player plug-ins correspond one-to-one with at least two pieces of text information. The at least two player plug-ins are used to play voice data corresponding to at least two pieces of text information. For example, as shown in FIG. 11, the record details interface 1101 may include not only the text information "No. 1, Kefa Road, Nanshan District, Shenzhen" 1102, the text information "um" 1103, and the text information "phone number is 88767655" 1104, but also Including player plug-in 1105, player plug-in 1106, and player plug-in 1107. The player plug-in 1105 is used to play the voice data "No. 1, Kefa Road, Nanshan District, Shenzhen" corresponding to the text information 1102. The player plug-in 1106 is used to play the voice data "um" corresponding to the text information 1103. The player plug-in 1107 is used to play the voice data "phone number 88876655" corresponding to the text information 1104.
在一些情况下,终端200将终端200录制的语音数据转换为文本信息时,转换得到的文本信息与终端200录制的语音数据的文本可能并不完全一致。即终端200转换得到的文本信息可能会出现一些错误。本申请实施例中,终端200可以接收用户对至少两段文本信息中任一段文本信息(如第二文本信息)的第四操作(即修改操作),修改终端200中保存的第二文本信息。例如,该第四操作可以是用户对第二文本信息的单击操作、双击操作或者长按操作等任一种操作。该第四操作与上述第二操作不同。第二文本信息是至少两段文本信息中的一段文本信息。第二文本信息与至少两段语音数据中的第二语音数据段对应。如此,用户便可以控制终端200播放第二文本信息对应的第二语音数据段,对比终端200播放的第二语音数据段和终端200显示的第二文本信息。当第二文本信息与终端200播放的第二语音数据段的文本不一致时,操作终端200修改第二文本信息。In some cases, when the terminal 200 converts the voice data recorded by the terminal 200 into text information, the converted text information may not be completely consistent with the text of the voice data recorded by the terminal 200. That is, some errors may occur in the text information converted by the terminal 200. In the embodiment of the present application, the terminal 200 may receive a fourth operation (that is, a modification operation) performed by a user on at least two pieces of text information (such as second text information), and modify the second text information stored in the terminal 200. For example, the fourth operation may be any operation such as a user's single-click operation, double-click operation, or long-press operation on the second text information. This fourth operation is different from the second operation described above. The second text information is a piece of text information among at least two pieces of text information. The second text information corresponds to a second voice data segment of at least two pieces of voice data. In this way, the user can control the terminal 200 to play the second voice data segment corresponding to the second text information, and compare the second voice data segment played by the terminal 200 with the second text information displayed by the terminal 200. When the second text information is inconsistent with the text of the second voice data segment played by the terminal 200, the operation terminal 200 modifies the second text information.
其中,响应于第四操作,第一终端可以将第二文本信息修改为第三文本信息;在第一界面上显示第三文本信息;接收用户对第三文本信息的第一操作,播放第二语音数据段。In response to the fourth operation, the first terminal may modify the second text information to the third text information; display the third text information on the first interface; receive the first operation of the user on the third text information, and play the second Voice data segment.
示例性的,如图12中的(a)所示,记录详情界面1201中不仅可以包括文本信息“电话号码是8877665”1202,还可以包括播放器插件1203。播放器插件1203用于播放表2所示的语音数据5“电话号码是88776655”。响应于用户对播放器插件1203的单击操作,终端200播放语音数据5“电话号码是88776655”后,用户210发现文本信息“电话号码是8877665”1202与语音数据5不同。此时,用户可以按照语音数据5修改文本信息“电话号码是8877665”1202。Exemplarily, as shown in (a) of FIG. 12, the record details interface 1201 may include not only the text information "phone number is 8877665" 1202, but also a player plug-in 1203. The player plug-in 1203 is used to play the voice data 5 "phone number is 88767655" shown in Table 2. In response to the user's click operation on the player plug-in 1203, after the terminal 200 plays the voice data 5 "phone number is 88767665", the user 210 finds that the text message "phone number is 8877665" 1202 is different from the voice data 5. At this time, the user can modify the text information "phone number is 8877665" 1202 according to the voice data 5.
例如,终端200可以接收用户对第二文本信息“电话号码是8877665”1202的第四操作(如双击操作)。响应于该修改操作,终端200可以显示图12中的(b)所示的记录详情界面1204。记录详情界面1204中包括显示有第二文本信息的修改框1205和键盘1206。如图12中的(c)所示,用户在修改框1205中输入了正确的文本信息“电 话号码是88776655”(即第三文本信息)。响应于用户对修改框1205中“OK”按钮的点击操作,终端200可以显示图12中的(d)所示的记录详情界面1208。记录详情界面1208中包括文本信息“电话号码是88776655”1207(即第三文本信息)。此后,终端200接收用户对文本信息“电话号码是88776655”1207的第一操作,可以播放第二语音数据段(即与第二文本信息对应的语音数据段)。For example, the terminal 200 may receive a fourth operation (such as a double-click operation) performed by the user on the second text information "phone number is 8877665" 1202. In response to the modification operation, the terminal 200 may display a record detail interface 1204 shown in (b) of FIG. 12. The record details interface 1204 includes a modification box 1205 and a keyboard 1206 displaying the second text information. As shown in (c) of FIG. 12, the user entered correct text information "phone number is 88767655" (i.e., third text information) in the modification box 1205. In response to the user's click operation on the “OK” button in the modification box 1205, the terminal 200 may display the record details interface 1208 shown in (d) in FIG. 12. The record details interface 1208 includes text information "phone number is 88767655" 1207 (ie, third text information). Thereafter, the terminal 200 receives the first operation of the user on the text information "phone number is 88767665" 1207, and can play the second voice data segment (that is, the voice data segment corresponding to the second text information).
本申请实施例中,终端200可以响应于用户对文本信息的修改操作,采用修改后的文本信息替换修改前的文本信息。如此,用户则可以根据终端200中保存的语音数据修改终端200转换语音数据得到的文本信息,可以修正终端200转换语音数据得到文本信息时出现的错误。In the embodiment of the present application, the terminal 200 may respond to a user's modification operation on the text information, and replace the text information before the modification with the modified text information. In this way, the user can modify the text information obtained by the terminal 200 by converting the voice data according to the voice data stored in the terminal 200, and can correct an error that occurs when the terminal 200 converts the voice data to obtain the text information.
可以理解,通话记录界面中的通话记录项中的位置有限,可能并不足以完全显示上述至少两段文本信息。在这种情况下,通话记录项中可以包括上述至少两段文本信息的关键字。终端200响应于用户对该关键字的点击操作(如单击操作)可以显示上述至少两段文本信息及其对应的播放器插件。例如,终端200可以显示图13中的(a)所示的通话记录界面1301。通话记录界面1301中包括通话记录项1302。通话记录项1302中包括关键字“地址、电话”1303。响应于用户对关键字“地址、电话”1303的点击操作,终端200可以显示图13中的(b)所示的至少两段文本信息及其对应的播放器插件1304。It can be understood that the positions of the call log items in the call log interface are limited, and may not be sufficient to fully display the above at least two pieces of text information. In this case, the call record entry may include keywords of the at least two pieces of text information. The terminal 200 may display the at least two pieces of text information and the corresponding player plug-in in response to a user's click operation (such as a click operation) on the keyword. For example, the terminal 200 may display a call history interface 1301 shown in (a) of FIG. 13. The call history interface 1301 includes a call history item 1302. The call log item 1302 includes a keyword "address, phone" 1303. In response to a user's click operation on the keyword "address, phone" 1303, the terminal 200 may display at least two pieces of text information shown in (b) in FIG. 13 and their corresponding player plug-ins 1304.
可选的,在本申请实施例的一种实现方式中,终端200录制的至少两段语音数据仅包括由接收自终端300的音频电信号转换而来的语音数据,不包括麦克风270C捕获的语音数据。换言之,终端200仅录制来自对方的语音数据。终端200可以按照录制每段语音数据的时间先后顺序保存上述至少两段语音数据。Optionally, in an implementation manner of the embodiment of the present application, the at least two pieces of voice data recorded by the terminal 200 include only voice data converted from audio and electrical signals received from the terminal 300, and do not include voice captured by the microphone 270C. data. In other words, the terminal 200 records only voice data from the counterpart. The terminal 200 may save the at least two pieces of voice data in sequence according to the time sequence of recording each piece of voice data.
或者,在另一种实现方式中,终端200录制的至少两段语音数据仅包括麦克风270C捕获的语音数据,不包括由接收自终端300的音频电信号转换而来的语音数据。换言之,终端200仅录制本机机主发出的语音数据。终端200可以按照录制每段语音数据的时间先后顺序保存上述至少两段语音数据。Or, in another implementation manner, the at least two pieces of voice data recorded by the terminal 200 include only voice data captured by the microphone 270C, and do not include voice data converted from audio and electrical signals received from the terminal 300. In other words, the terminal 200 records only the voice data sent by the owner of the local machine. The terminal 200 may save the at least two pieces of voice data in sequence according to the time sequence of recording each piece of voice data.
在上述两种实现方式中,终端200在通话记录界面中的通话记录项中显示语音数据的文本信息或者播放器插件的方式,终端200在通话记录项对应的记录详情界面中显示语音数据的文本信息或者播放器插件的方式,可以参考上述实施例中的详细介绍,本申请实施例这里不再赘述。In the above two implementation manners, the terminal 200 displays the text information of the voice data or the player plug-in in the call record item in the call record interface, and the terminal 200 displays the text of the voice data in the record details interface corresponding to the call record item For information or the manner of the player plug-in, reference may be made to the detailed description in the foregoing embodiment, which will not be repeated here in the embodiment of the present application.
可以理解的是,上述终端200为了实现上述功能,其包含了执行各个功能相应的硬件结构和/或软件模块。本领域技术人员应该很容易意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,本申请实施例能够以硬件或硬件和计算机软件的结合形式来实现。某个功能究竟以硬件还是计算机软件驱动硬件的方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请实施例的范围。It can be understood that, in order to implement the foregoing functions, the terminal 200 includes a hardware structure and / or a software module corresponding to each function. Those skilled in the art should easily realize that, in combination with the units and algorithm steps of each example described in the embodiments disclosed herein, the embodiments of the present application can be implemented in the form of hardware or a combination of hardware and computer software. Whether a certain function is performed by hardware or computer software-driven hardware depends on the specific application of the technical solution and design constraints. Professional technicians can use different methods to implement the described functions for each specific application, but such implementation should not be considered to be beyond the scope of the embodiments of the present application.
本申请实施例可以根据上述方法示例对上述终端200进行功能模块的划分,例如,可以对应各个功能划分各个功能模块,也可以将两个或两个以上的功能集成在一个处理模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。需要说明的是,本申请实施例中对模块的划分是示意性的,仅仅为一种逻 辑功能划分,实际实现时可以有另外的划分方式。In the embodiment of the present application, functional modules of the terminal 200 may be divided according to the foregoing method example. For example, each functional module may be divided corresponding to each function, or two or more functions may be integrated into one processing module. The above integrated modules may be implemented in the form of hardware or software functional modules. It should be noted that the division of the modules in the embodiments of the present application is schematic, and is only a logical function division. In actual implementation, there may be another division manner.
示例性的,在采用对应各个功能划分各个功能模块的情况下,图14示出了上述实施例中所涉及的终端的一种可能的结构示意图,该终端1400包括:控制模块1401、监听模块1402、自动语音识别模块1403、监听模块1404、回声抑制模块1405、录音模块1406、录音模块1407、文字记录模块1408和放音模块1409和显示模块1410。Exemplarily, in a case where each functional module is divided according to each function, FIG. 14 shows a possible structural diagram of a terminal involved in the foregoing embodiment. The terminal 1400 includes a control module 1401 and a monitoring module 1402. , Automatic speech recognition module 1403, monitoring module 1404, echo suppression module 1405, recording module 1406, recording module 1407, text recording module 1408, playback module 1409, and display module 1410.
其中,控制模块1401用于在终端1400与其他终端进行语音通信时,控制监听模块1402监听本机机主发出的语音数据。监听模块1402用于将监听到的语音数据传输至自动语音识别模块1403。自动语音识别模块1403将监听模块1402监听到的语音数据转换为文本信息。The control module 1401 is configured to control the monitoring module 1402 to monitor the voice data sent by the host when the terminal 1400 performs voice communication with other terminals. The monitoring module 1402 is configured to transmit the monitored voice data to the automatic voice recognition module 1403. The automatic voice recognition module 1403 converts the voice data monitored by the monitoring module 1402 into text information.
控制模块1402还用于判断自动语音识别模块1403识别到的文本信息(即监听模块1402监听到的语音数据的文本)是否与预置启动词匹配。控制模块1402还用于在监听模块1402监听到的语音数据的文本与预置启动词匹配时,启动监听模块1404、录音模块1406、录音模块1407和文字记录模块1408。The control module 1402 is further configured to determine whether the text information recognized by the automatic voice recognition module 1403 (that is, the text of the voice data monitored by the monitoring module 1402) matches the preset start word. The control module 1402 is further configured to start the monitoring module 1404, the recording module 1406, the recording module 1407, and the text recording module 1408 when the text of the voice data monitored by the monitoring module 1402 matches a preset start word.
其中,监听模块1404用于监听来自对端的语音数据,并将监听到的语音数据传输至自动语音识别模块1403。自动语音识别模块1403用于将监听模块1404监听到的语音数据转换为文本信息。控制模块1402还用于在监听模块1402监听到的语音数据的文本与预置启动词匹配时,控制自动语音识别模块1403将转换得到的文本信息及其来源信息传输至文字记录模块1408。文字记录模块1408用于记录来自自动语音识别模块1403的文本信息及其来源信息。录音模块1406用于记录本机机主发出的语音数据。录音模块1407用于记录来自对端的语音数据。The monitoring module 1404 is configured to monitor the voice data from the peer end and transmit the monitored voice data to the automatic voice recognition module 1403. The automatic speech recognition module 1403 is configured to convert the speech data monitored by the monitoring module 1404 into text information. The control module 1402 is further configured to control the automatic speech recognition module 1403 to transmit the converted text information and source information to the text recording module 1408 when the text of the voice data monitored by the monitoring module 1402 matches a preset start word. The text recording module 1408 is used to record text information and its source information from the automatic speech recognition module 1403. The recording module 1406 is used to record the voice data sent by the owner of the machine. The recording module 1407 is configured to record voice data from a peer end.
进一步的,控制模块1402还用于在监听模块1402监听到的语音数据的文本与预置结束词匹配时,关闭监听模块1404、录音模块1406、录音模块1407和文字记录模块1408。Further, the control module 1402 is further configured to turn off the monitoring module 1404, the recording module 1406, the recording module 1407, and the text recording module 1408 when the text of the voice data monitored by the monitoring module 1402 matches a preset ending word.
可选的,控制模块1402还用于在监听模块1402监听到的语音数据的文本与预置结束词匹配时,控制放音模块1409播放上述实施例中所述的第二提示信息,如提示音。控制模块1402还用于在监听模块1402监听到的语音数据的文本与预置结束词匹配时,启动回声抑制模块1405。控制模块1402还用于在监听模块1402监听到的语音数据的文本与预置结束词匹配时,控制放音模块1409播放上述实施例所示的第二提示信息,如提示音。回声抑制模块1405用于根据放音模块1409播放的提示音对监听模块1402监听到的语音数据对回声抑制。控制模块1402还用于在放音模块1409播放第二提示信息后,关闭放音模块1409。Optionally, the control module 1402 is further configured to control the playback module 1409 to play the second prompt information described in the foregoing embodiment, such as a prompt sound, when the text of the voice data monitored by the monitoring module 1402 matches a preset ending word. . The control module 1402 is further configured to start the echo suppression module 1405 when the text of the voice data monitored by the monitoring module 1402 matches a preset ending word. The control module 1402 is further configured to control the playback module 1409 to play the second prompt information, such as a prompt sound, shown in the foregoing embodiment when the text of the voice data monitored by the monitoring module 1402 matches a preset ending word. The echo suppression module 1405 is configured to suppress the echo of the voice data monitored by the monitoring module 1402 according to the prompt sound played by the playback module 1409. The control module 1402 is further configured to close the playback module 1409 after the playback module 1409 plays the second prompt message.
上述控制模块1401还用于控制显示模块1410显示上述实施例所述的来电提醒界面和通话记录界面等。The control module 1401 is further configured to control the display module 1410 to display an incoming call reminder interface, a call history interface, and the like according to the embodiment.
当然,终端1300包括但不限于上述所列举的单元模块。例如,终端300还可以包括接收模块和发送模块。接收模块用于接收其他终端发送的数据或者指令。发送模块用于向其他终端发送数据或者指令。并且,上述功能单元的具体所能够实现的功能也包括但不限于上述实例所述的方法步骤对应的功能,终端1400的其他单元的详细描述可以参考其所对应方法步骤的详细描述,本申请实施例这里不再赘述。Of course, the terminal 1300 includes, but is not limited to, the unit modules listed above. For example, the terminal 300 may further include a receiving module and a sending module. The receiving module is used to receive data or instructions sent by other terminals. The sending module is used to send data or instructions to other terminals. In addition, the specific functions that can be implemented by the above functional units also include but are not limited to the functions corresponding to the method steps described in the above examples. For detailed descriptions of other units of the terminal 1400, please refer to the detailed description of the corresponding method steps. Examples are not repeated here.
在采用集成的单元的情况下,图15示出了上述实施例中所涉及的终端的一种可能 的结构示意图。该终端1500包括:处理模块1501、存储模块1502、显示模块1503、通信模块1504和音频模块1505。处理模块1501用于对终端1500的动作进行控制管理。例如,处理模块1501可以用于支持终端1500执行上述方法实施例中的S402,S403,S404,S404',S405,S406,S701,S702,生成S1001中的第一界面,S1002中“接收第一操作”,和/或用于本文所描述的技术的其它过程。显示模块1503用于显示处理模块1501生成的图像。例如,显示模块1503用于支持终端1500执行上述方法实施例中的S1001,和/或用于本文所描述的技术的其它过程。存储模块1502,用于保存终端的程序代码和数据。例如,存储模块1502用于保存处理器执行S404和S404'录制的语音数据以及录制的语音数据的文本信息。通信模块1504用于支持终端200与其他网络实体(如终端300)的通信。例如,通信模块1504用于支持终端1500执行上述方法实施例中的S401,和/或用于本文所描述的技术的其它过程。音频模块1505用于采集终端200的用户发出的语音数据,以及播放语音数据。例如,音频模块1505(如音频模块1505中的麦克风)用于支持终端1500执行“捕获语音数据”的操作。音频模块1505(如音频模块1505中的扬声器和受话器)用于支持终端1500执行“播放语音数据”的操作。终端1500包括的各个单元的详细描述可以参考上述各方法实施例中的描述,这里不再赘述。In the case where an integrated unit is used, FIG. 15 shows a possible structural diagram of a terminal involved in the foregoing embodiment. The terminal 1500 includes a processing module 1501, a storage module 1502, a display module 1503, a communication module 1504, and an audio module 1505. The processing module 1501 is configured to control and manage the actions of the terminal 1500. For example, the processing module 1501 may be used to support the terminal 1500 to execute S402, S403, S404, S404 ', S405, S406, S701, and S702 in the above method embodiment to generate a first interface in S1001, and "receive a first operation in S1002" ", And / or other processes for the techniques described herein. The display module 1503 is configured to display an image generated by the processing module 1501. For example, the display module 1503 is configured to support the terminal 1500 to execute S1001 in the foregoing method embodiment, and / or other processes used in the technology described herein. The storage module 1502 is configured to store program codes and data of the terminal. For example, the storage module 1502 is configured to store the speech data recorded by the processor in S404 and S404 ′ and the text information of the recorded speech data. The communication module 1504 is configured to support communication between the terminal 200 and other network entities (such as the terminal 300). For example, the communication module 1504 is used to support the terminal 1500 to perform S401 in the foregoing method embodiment, and / or other processes used in the technology described herein. The audio module 1505 is configured to collect voice data sent by a user of the terminal 200 and play the voice data. For example, the audio module 1505 (such as a microphone in the audio module 1505) is used to support the terminal 1500 to perform an operation of "capturing voice data". The audio module 1505 (such as a speaker and a receiver in the audio module 1505) is used to support the terminal 1500 to perform an operation of "playing voice data." For a detailed description of each unit included in the terminal 1500, reference may be made to the description in the foregoing method embodiments, and details are not described herein again.
其中,处理模块1501可以是处理器或控制器,例如可以是中央处理器(Central Processing Unit,CPU),通用处理器,数字信号处理器(Digital Signal Processor,DSP),专用集成电路(Application-Specific Integrated Circuit,ASIC),现场可编程门阵列(Field Programmable Gate Array,FPGA)或者其他可编程逻辑器件、晶体管逻辑器件、硬件部件或者其任意组合。其可以实现或执行结合本申请公开内容所描述的各种示例性的逻辑方框,模块和电路。所述处理器也可以是实现计算功能的组合,例如包含一个或多个微处理器组合,DSP和微处理器的组合等等。通信模块可以是收发器、收发电路或通信接口等。存储模块1502可以是存储器。The processing module 1501 may be a processor or a controller. For example, the processing module 1501 may be a central processing unit (CPU), a general-purpose processor, a digital signal processor (DSP), and an application-specific integrated circuit (Application-Specific Integrated Circuit (ASIC), Field Programmable Gate Array (FPGA), or other programmable logic devices, transistor logic devices, hardware components, or any combination thereof. It may implement or execute various exemplary logical blocks, modules, and circuits described in connection with the present disclosure. The processor may also be a combination that implements computing functions, such as a combination including one or more microprocessors, a combination of a DSP and a microprocessor, and so on. The communication module may be a transceiver, a transceiver circuit, or a communication interface. The storage module 1502 may be a memory.
处理模块1501为处理器(如图2所示的处理器210),通信模块1504包括射频模块(如图2所示的射频模块250)。通信模块还可以包括Wi-Fi模块和蓝牙模块等。射频模块250、Wi-Fi模块和蓝牙模块等通信模块可以统称为通信接口。存储模块1502为存储器(如图2所示的内部存储器221)。显示模块1503为触摸屏(包括图2所示的显示屏294,该显示屏294中集成了显示面板和触控面板)。音频模块270可以包括麦克风(如图2所示的麦克风270C)、扬声器(如图2所示的扬声器270A)、受话器(如图2所示的受话器270B)和耳机接口(如图2所示的耳机接口270D)。本申请实施例所提供的终端可以为图2所示的终端200。其中,上述处理器、通信接口、触摸屏、存储器、麦克风、受话器和扬声器可以通过总线耦合在一起。The processing module 1501 is a processor (such as the processor 210 shown in FIG. 2), and the communication module 1504 includes a radio frequency module (such as the radio frequency module 250 shown in FIG. 2). The communication module may also include a Wi-Fi module and a Bluetooth module. Communication modules such as the radio frequency module 250, the Wi-Fi module, and the Bluetooth module may be collectively referred to as a communication interface. The storage module 1502 is a memory (the internal memory 221 shown in FIG. 2). The display module 1503 is a touch screen (including a display screen 294 shown in FIG. 2, in which a display panel and a touch panel are integrated). The audio module 270 may include a microphone (such as the microphone 270C shown in FIG. 2), a speaker (such as the speaker 270A shown in FIG. 2), a receiver (such as the receiver 270B shown in FIG. 2), and a headphone interface (as shown in FIG. 2). Headphone jack 270D). The terminal provided in this embodiment of the present application may be the terminal 200 shown in FIG. 2. The processor, the communication interface, the touch screen, the memory, the microphone, the receiver, and the speaker may be coupled together through a bus.
本申请实施例还提供一种计算机存储介质,该计算机存储介质中存储有计算机程序代码,当上述处理器执行该计算机程序代码时,该终端执行图4、图10A或图10B任一附图中的相关方法步骤实现上述实施例中的方法。An embodiment of the present application further provides a computer storage medium. The computer storage medium stores computer program code. When the processor executes the computer program code, the terminal executes any of the drawings in FIG. 4, FIG. 10A, or FIG. 10B. The related method steps implement the method in the above embodiment.
本申请实施例还提供了一种计算机程序产品,当该计算机程序产品在计算机上运行时,使得计算机执行图4、图10A或图10B任一附图中的相关方法步骤实现上述实施例中的方法。The embodiment of the present application also provides a computer program product, and when the computer program product is run on a computer, the computer is caused to execute the related method steps in any one of FIG. 4, FIG. 10A, or FIG. method.
其中,本申请实施例提供的终端1400、终端1500、计算机存储介质或者计算机程序产品均用于执行上文所提供的对应的方法,因此,其所能达到的有益效果可参考上文所提供的对应的方法中的有益效果,此处不再赘述。The terminal 1400, the terminal 1500, the computer storage medium, or the computer program product provided in the embodiment of the present application are all used to execute the corresponding methods provided above. Therefore, for the beneficial effects that can be achieved, refer to the foregoing provided. The beneficial effects in the corresponding method are not repeated here.
通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。Through the description of the above embodiments, those skilled in the art can clearly understand that, for the convenience and brevity of the description, only the division of the above functional modules is used as an example. In practical applications, the above functions can be allocated as required. Completed by different functional modules, that is, the internal structure of the device is divided into different functional modules to complete all or part of the functions described above.
在本申请所提供的几个实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个装置,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed apparatus and method may be implemented in other ways. For example, the device embodiments described above are only schematic. For example, the division of the modules or units is only a logical function division. In actual implementation, there may be another division manner. For example, multiple units or components may be divided. The combination can either be integrated into another device, or some features can be ignored or not implemented. In addition, the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, which may be electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是一个物理单元或多个物理单元,即可以位于一个地方,或者也可以分布到多个不同地方。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may be one physical unit or multiple physical units, that is, may be located in one place, or may be distributed to multiple different places. . Some or all of the units may be selected according to actual needs to achieve the objective of the solution of this embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以使用硬件的形式实现,也可以使用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each of the units may exist separately physically, or two or more units may be integrated into one unit. The above integrated unit may be implemented in the form of hardware or in the form of software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个可读取存储介质中。基于这样的理解,本申请实施例的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该软件产品存储在一个存储介质中,包括若干指令用以使得一个设备(可以是单片机,芯片等)或处理器(processor)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。When the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it may be stored in a readable storage medium. Based on such an understanding, the technical solution of the embodiments of the present application is essentially a part that contributes to the existing technology or all or part of the technical solution may be embodied in the form of a software product that is stored in a storage medium. Included are several instructions for causing a device (which can be a single-chip microcomputer, a chip, etc.) or a processor to execute all or part of the steps of the method described in the embodiments of the present application. The foregoing storage medium includes various media that can store program codes, such as a U disk, a mobile hard disk, a ROM, a RAM, a magnetic disk, or an optical disk.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何在本申请揭露的技术范围内的变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。The above description is only a specific implementation of this application, but the scope of protection of this application is not limited to this. Any changes or replacements within the technical scope disclosed in this application shall be covered by the scope of protection of this application. . Therefore, the protection scope of this application shall be subject to the protection scope of the claims.

Claims (26)

  1. 一种通信过程中信息的记录及显示方法,其特征在于,应用于第一终端与第二终端进行语音通信的过程中,所述方法包括:A method for recording and displaying information in a communication process, which is characterized in that the method is applied to a process in which a first terminal performs voice communication with a second terminal, and the method includes:
    所述第一终端识别所述第一终端的麦克风捕获的语音数据;The first terminal recognizes voice data captured by a microphone of the first terminal;
    若所述麦克风捕获的第一语音数据对应的文本与预置启动词匹配,则所述第一终端开始录制所述第二终端的语音数据,所述第二终端的语音数据是由接收自所述第二终端的音频电信号转换而来的;If the text corresponding to the first voice data captured by the microphone matches a preset start word, the first terminal starts recording the voice data of the second terminal, and the voice data of the second terminal is received by the user. The audio and electric signals of the second terminal are converted;
    在所述语音数据录制结束后,所述第一终端显示第一界面;所述第一界面包括录制的语音数据对应的文本;After recording of the voice data, the first terminal displays a first interface; the first interface includes text corresponding to the recorded voice data;
    所述第一终端接收用户对所述录制的语音数据对应的文本的第一操作,响应于所述第一操作播放所述录制的语音数据。The first terminal receives a user's first operation on the text corresponding to the recorded voice data, and plays the recorded voice data in response to the first operation.
  2. 根据权利要求1所述的方法,其特征在于,所述在所述语音数据录制结束后,所述第一终端显示第一界面,包括:The method according to claim 1, wherein after the recording of the voice data is finished, the first terminal displaying the first interface comprises:
    在所述语音数据录制结束后,所述第一终端自动显示所述第一界面;After the voice data recording ends, the first terminal automatically displays the first interface;
    或者,or,
    响应于所述语音通信结束,所述第一终端显示所述第一界面;In response to the end of the voice communication, the first terminal displays the first interface;
    或者,or,
    在所述语音数据录制结束后,响应于用户输入第二操作,所述第一终端显示所述第一界面,所述第二操作用于指示所述第一终端显示所述第一终端的通话记录界面,所述第一界面为所述通话记录界面,所述通话记录界面包括所述语音通信的通话记录项,所述通话记录项用于记录所述语音通信的通话记录信息,所述通话记录项中包括所述录制的语音数据对应的文本;After the voice data recording ends, in response to a user inputting a second operation, the first terminal displays the first interface, and the second operation is used to instruct the first terminal to display the call of the first terminal Record interface, the first interface is the call record interface, the call record interface includes a call record item of the voice communication, the call record item is used to record call record information of the voice communication, and the call The record includes text corresponding to the recorded voice data;
    或者,or,
    在所述语音数据录制结束后,响应于用户对通话记录界面中所述语音通话的通话记录项的第三操作,所述第一终端显示所述第一界面,所述第三操作用于指示所述第一终端显示所述语音通信的记录详情界面,所述第一界面是所述记录详情界面,所述记录详情界面用于显示所述语音通信的通话记录信息,所述记录详情界面中包括所述录制的语音数据对应的文本。After the recording of the voice data is completed, in response to a user's third operation on the call record item of the voice call in the call history interface, the first terminal displays the first interface, and the third operation is used to indicate The first terminal displays a record detail interface of the voice communication, the first interface is the record detail interface, the record detail interface is used to display call record information of the voice communication, and the record detail interface Including text corresponding to the recorded voice data.
  3. 根据权利要求1或2所述的方法,其特征在于,所述方法还包括:The method according to claim 1 or 2, further comprising:
    若所述麦克风捕获的第一语音数据对应的文本与所述预置启动词匹配,则所述第一终端开始录制所述麦克风捕获的语音数据。If the text corresponding to the first voice data captured by the microphone matches the preset start word, the first terminal starts recording the voice data captured by the microphone.
  4. 根据权利要求1-3中任意一项所述的方法,其特征在于,所述录制的语音数据对应的文本包括至少两段文本信息,所述录制的语音数据包括至少两段语音数据,所述至少两段文本信息与所述至少两段语音数据一一对应;The method according to any one of claims 1-3, wherein the text corresponding to the recorded voice data includes at least two pieces of text information, the recorded voice data includes at least two pieces of voice data, and At least two pieces of text information correspond one-to-one with the at least two pieces of voice data;
    所述第一终端接收用户对所述录制的语音数据对应的文本的第一操作,响应于所述第一操作播放所述语音数据,包括:Receiving, by the first terminal, a first operation performed by a user on text corresponding to the recorded voice data, and playing the voice data in response to the first operation includes:
    所述第一终端接收用户对第一文本信息的所述第一操作;响应于所述第一操作,所述第一终端播放第一语音数据段;Receiving, by the first terminal, the user's first operation on the first text information; in response to the first operation, the first terminal plays a first voice data segment;
    其中,所述第一文本信息为所述至少两段文本信息中的一段文本信息;所述第一 文本信息与所述第一语音数据段对应。The first text information is a piece of text information in the at least two pieces of text information; the first text information corresponds to the first voice data segment.
  5. 根据权利要求4所述的方法,其特征在于,所述方法还包括:The method according to claim 4, further comprising:
    所述第一终端接收用户对第二文本信息的第四操作,所述第四操作用于将所述第二文本信息修改为第三文本信息;所述第二文本信息为所述至少两段文本信息中的一段文本信息,所述第二文本信息与第二语音数据段对应;Receiving, by the first terminal, a fourth operation performed by the user on the second text information, the fourth operation being used to modify the second text information into third text information; the second text information is the at least two segments A piece of text information in the text information, and the second text information corresponds to a second voice data segment;
    响应于所述第四操作,所述第一终端将所述第二文本信息修改为所述第三文本信息;In response to the fourth operation, the first terminal modifies the second text information into the third text information;
    所述第一终端在所述第一界面上显示所述第三文本信息;Displaying, by the first terminal, the third text information on the first interface;
    所述第一终端接收用户对所述第三文本信息的所述第一操作,播放所述第二语音数据段。Receiving, by the first terminal, the user's first operation on the third text information, and playing the second voice data segment.
  6. 根据权利要求4或5所述的方法,其特征在于,所述第一终端显示第一界面;所述第一界面包括所述录制的语音数据对应的文本,包括:The method according to claim 4 or 5, wherein the first terminal displays a first interface; the first interface includes text corresponding to the recorded voice data, and includes:
    所述第一终端按照录制文本信息对应的语音数据段的时间先后顺序,以及文本信息对应的语音数据段的来源信息,在所述第一界面显示所述至少两段文本信息;Displaying, by the first terminal, the at least two pieces of text information on the first interface in accordance with the chronological order of the voice data segments corresponding to the recorded text information and the source information of the voice data segments corresponding to the text information;
    其中,所述来源信息用于指示所述语音数据段是所述麦克风捕获的语音数据,或者所述第二终端的语音数据。The source information is used to indicate that the voice data segment is voice data captured by the microphone or voice data of the second terminal.
  7. 根据权利要求1-6中任意一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1-6, wherein the method further comprises:
    若所述第一语音数据对应的文本与所述预置启动词匹配,则所述第一终端发出第一提示信息,所述第一提示信息用于提示用户所述第一终端开始录制语音数据,所述第一提示信息为提示音或者振动提示。If the text corresponding to the first voice data matches the preset start word, the first terminal sends a first prompt message, and the first prompt information is used to prompt a user that the first terminal starts to record voice data The first prompt information is a prompt sound or a vibration prompt.
  8. 根据权利要求1-7中任意一项所述的方法,其特征在于,在所述第一终端显示第一界面之前,所述方法还包括:The method according to any one of claims 1 to 7, wherein before the first terminal displays a first interface, the method further comprises:
    在录制所述语音数据过程中,若所述麦克风捕获的第二语音数据对应的文本与预置结束醒词匹配,则所述第一终端停止录制语音数据。During the recording of the voice data, if the text corresponding to the second voice data captured by the microphone matches a preset end word, the first terminal stops recording the voice data.
  9. 根据权利要求8所述的方法,其特征在于,所述方法还包括:The method according to claim 8, further comprising:
    在录制所述语音数据过程中,若所述麦克风捕获的第二语音数据对应的文本与预置结束醒词匹配,则所述第一终端发出第二提示信息,所述第二提示信息用于提示用户所述第一终端停止录制语音数据,所述第二提示信息为提示音或者振动提示。During the recording of the voice data, if the text corresponding to the second voice data captured by the microphone matches a preset ending wake-up word, the first terminal sends a second prompt message, where the second prompt information is used for Prompting the user that the first terminal stops recording voice data, and the second prompt information is a prompt sound or a vibration prompt.
  10. 根据权利要求9所述的方法,其特征在于,所述第一提示信息和第二提示信息为提示音,所述方法还包括:The method according to claim 9, wherein the first prompt information and the second prompt information are prompt sounds, and the method further comprises:
    所述第一终端在所述第一语音数据对应的文本与所述预置启动词匹配时,确定所述第一终端使用扬声器播放语音数据;Determining, by the first terminal when the text corresponding to the first voice data matches the preset start word, that the first terminal uses a speaker to play voice data;
    所述第一终端根据所述扬声器播放的语音数据对所述麦克风采集的语音数据进行回声抑制;The first terminal performs echo suppression on the voice data collected by the microphone according to the voice data played by the speaker;
    所述第一终端在播放所述第二提示信息后,停止对所述麦克风采集的语音数据进行回声抑制。After the first terminal plays the second prompt information, the first terminal stops performing echo suppression on the voice data collected by the microphone.
  11. 根据权利要求4-6中任意一项所述的方法,其特征在于,所述第一终端按照录制每段语音数据的时间先后顺序,以及每段语音数据的来源信息,保存所述至少两段语音数据;The method according to any one of claims 4 to 6, wherein the first terminal saves the at least two segments in accordance with a chronological order of recording each segment of voice data and a source information of each segment of voice data. Voice data
    所述来源信息用于指示语音数据段是所述麦克风捕获,或者所述来源信息用于指示语音数据段是所述第二终端的语音数据。The source information is used to indicate that the voice data segment is captured by the microphone, or the source information is used to indicate that the voice data segment is voice data of the second terminal.
  12. 根据权利要求4-6或11中任意一项所述的方法,其特征在于,所述第一界面中还包括至少两个播放器插件;The method according to any one of claims 4-6 or 11, wherein the first interface further includes at least two player plug-ins;
    所述至少两个播放插件用于播放所述至少两段语音数据,所述至少两个播放插件与所述至少两段文本信息一一对应。The at least two playback plug-ins are used to play the at least two pieces of voice data, and the at least two playback plug-ins correspond to the at least two pieces of text information on a one-to-one basis.
  13. 一种终端,其特征在于,所述终端是第一终端,所述终端包括:一个或多个处理器、存储器、触摸屏、麦克风、通信接口、受话器和扬声器;所述存储器、所述显示器、所述通信接口与所述处理器耦合;所述触摸屏用于显示所述处理器生成的图像;所述麦克风用于捕获语音数据;所述存储器用于存储计算机程序代码;所述计算机程序代码包括计算机指令,当所述处理器执行上述计算机指令时,A terminal, wherein the terminal is a first terminal, and the terminal includes: one or more processors, a memory, a touch screen, a microphone, a communication interface, a receiver, and a speaker; the memory, the display, and The communication interface is coupled with the processor; the touch screen is used to display an image generated by the processor; the microphone is used to capture voice data; the memory is used to store computer program code; the computer program code includes a computer Instructions, when the processor executes the computer instructions described above,
    所述处理器,用于通过所述通信接口与第二终端进行语音通信;识别所述麦克风捕获的语音数据;若识别到所述麦克风捕获的第一语音数据对应的文本与预置启动词匹配,则开始录制所述第二终端的语音数据,所述第二终端的语音数据是由接收自所述第二终端的音频电信号转换而来的;所述处理器,还用于在所述存储器中保存录制的语音数据;The processor is configured to perform voice communication with the second terminal through the communication interface; identify voice data captured by the microphone; if it is recognized that the text corresponding to the first voice data captured by the microphone matches a preset start word , The recording of the voice data of the second terminal is started, and the voice data of the second terminal is converted from the audio and electrical signals received from the second terminal; the processor is further configured to: Recorded voice data stored in memory;
    所述处理器,还用于在所述语音数据录制结束后,控制所述触摸屏显示第一界面;所述第一界面包括所述录制的语音数据对应的文本;The processor is further configured to control the touch screen to display a first interface after the recording of the voice data is finished; the first interface includes text corresponding to the recorded voice data;
    所述处理器,还用于接收用户对所述触摸屏显示的所述文本的第一操作;响应于所述第一操作控制所述受话器或者所述扬声器播放所述语音数据。The processor is further configured to receive a first operation performed by the user on the text displayed on the touch screen; and control the receiver or the speaker to play the voice data in response to the first operation.
  14. 根据权利要求13所述的终端,其特征在于,所述处理器,用于在所述语音数据录制结束后,控制所述触摸屏显示第一界面,包括:The terminal according to claim 13, wherein the processor, configured to control the touch screen to display a first interface after the recording of the voice data is completed, comprises:
    所述处理器,用于在所述语音数据录制结束后,自动控制所述触摸屏显示所述第一界面;The processor is configured to automatically control the touch screen to display the first interface after the recording of the voice data is finished;
    或者,or,
    所述处理器,用于响应于所述语音通信结束,控制所述触摸屏显示所述第一界面;The processor is configured to control the touch screen to display the first interface in response to the end of the voice communication;
    或者,or,
    所述处理器,用于在所述语音数据录制结束后,响应于用户输入第二操作,控制所述触摸屏显示所述第一界面,所述第二操作用于指示所述终端显示所述终端的通话记录界面,所述第一界面为所述通话记录界面,所述通话记录界面包括所述语音通信的通话记录项,所述通话记录项用于记录所述语音通信的通话记录信息,所述通话记录项中包括所述录制的语音数据对应的文本;The processor is configured to control the touch screen to display the first interface in response to a user inputting a second operation after the recording of the voice data is finished, and the second operation is used to instruct the terminal to display the terminal The call record interface, the first interface is the call record interface, and the call record interface includes a call record item of the voice communication, and the call record item is used to record the call record information of the voice communication. The call record item includes text corresponding to the recorded voice data;
    或者,or,
    所述处理器,用于在所述语音数据录制结束后,响应于用户对通话记录界面中所述语音通话的通话记录项的第三操作,控制所述触摸屏显示所述第一界面,所述第三操作用于指示所述终端显示所述语音通信的记录详情界面,所述第一界面是所述记录详情界面,所述记录详情界面用于显示所述语音通信的通话记录信息,所述记录详情界面中包括所述录制的语音数据对应的文本。The processor is configured to control the touch screen to display the first interface in response to a third operation of a call record item of the voice call in the call record interface by the user after the recording of the voice data ends, and A third operation is used to instruct the terminal to display a record detail interface of the voice communication, the first interface is the record detail interface, the record detail interface is used to display call record information of the voice communication, and the The record details interface includes text corresponding to the recorded voice data.
  15. 根据权利要求13或14所述的终端,其特征在于,所述处理器,还用于若识 别到所述麦克风捕获的第一语音数据对应的文本与所述预置启动词匹配,则所述第一终端开始录制所述麦克风捕获的语音数据。The terminal according to claim 13 or 14, wherein the processor is further configured to: if it is recognized that the text corresponding to the first voice data captured by the microphone matches the preset start word, the processor is configured to: The first terminal starts recording voice data captured by the microphone.
  16. 根据权利要求13-15任意一项所述的终端,其特征在于,所述录制的语音数据对应的文本包括至少两段文本信息,所述录制的语音数据包括至少两段语音数据,所述至少两段文本信息与所述至少两段语音数据一一对应;The terminal according to any one of claims 13-15, wherein the text corresponding to the recorded voice data includes at least two pieces of text information, the recorded voice data includes at least two pieces of voice data, and the at least Two pieces of text information correspond one-to-one with the at least two pieces of voice data;
    所述处理器,用于接收用户对所述触摸屏显示的所述文本的第一操作,响应于所述第一操作控制所述受话器或者所述扬声器播放所述语音数据,包括:The processor, configured to receive a user's first operation on the text displayed on the touch screen, and control the receiver or the speaker to play the voice data in response to the first operation, includes:
    所述处理器,用于接收用户对所述触摸屏显示的第一文本信息的所述第一操作;响应于所述第一操作,控制所述受话器或者所述扬声器播放第一语音数据段;The processor is configured to receive the user's first operation on the first text information displayed on the touch screen; and in response to the first operation, control the receiver or the speaker to play a first voice data segment;
    其中,所述第一文本信息为所述至少两段文本信息中的一段文本信息;所述第一文本信息与所述第一语音数据段对应。The first text information is a piece of text information among the at least two pieces of text information; the first text information corresponds to the first voice data segment.
  17. 根据权利要求16所述的终端,其特征在于,所述处理器,还用于接收用户对所述触摸屏显示的第二文本信息的第四操作,所述第四操作用于将所述第二文本信息修改为第三文本信息;所述第二文本信息为所述至少两段文本信息中的一段文本信息,所述第二文本信息与第二语音数据段对应;响应于所述第四操作,将所述第二文本信息修改为所述第三文本信息;在所述存储器中保存所述第三文本信息,以及所述第三文本信息与所述第二语音数据段的对应关系;The terminal according to claim 16, wherein the processor is further configured to receive a fourth operation performed by the user on the second text information displayed on the touch screen, and the fourth operation is configured to use the second operation The text information is modified into third text information; the second text information is a piece of text information in the at least two pieces of text information, and the second text information corresponds to a second voice data segment; in response to the fourth operation Modify the second text information to the third text information; save the third text information in the memory, and a correspondence between the third text information and the second voice data segment;
    所述处理器,还用于控制所述触摸屏在所述第一界面上显示所述第三文本信息;接收用户对所述触摸屏显示的所述第三文本信息的所述第一操作,控制所述受话器或者扬声器播放所述第二语音数据段。The processor is further configured to control the touch screen to display the third text information on the first interface; receive the user's first operation on the third text information displayed on the touch screen, and control The handset or speaker plays the second voice data segment.
  18. 根据权利要求16或17所述的终端,其特征在于,所述处理器,用于控制所述触摸屏显示所述第一界面,包括:The terminal according to claim 16 or 17, wherein the processor, configured to control the touch screen to display the first interface, comprises:
    所述处理器,用于控制所述触摸屏按照录制文本信息对应的语音数据段的时间先后顺序,以及文本信息对应的语音数据段的来源信息,在所述第一界面显示所述至少两段文本信息;The processor is configured to control the touch screen to display the at least two pieces of text on the first interface in accordance with a chronological order of voice data segments corresponding to the recorded text information and source information of the voice data segments corresponding to the text information. information;
    其中,所述来源信息用于指示所述语音数据段是所述麦克风捕获的语音数据,或者所述第二终端的语音数据。The source information is used to indicate that the voice data segment is voice data captured by the microphone or voice data of the second terminal.
  19. 根据权利要求13-18中任意一项所述的终端,其特征在于,所述处理器,还用于若识别到所述麦克风捕获的所述第一语音数据对应的文本与所述预置启动词匹配,则发出第一提示信息,所述第一提示信息用于提示用户所述终端开始录制语音数据,所述第一提示信息为提示音或者振动提示。The terminal according to any one of claims 13 to 18, wherein the processor is further configured to, if the text corresponding to the first voice data captured by the microphone and the preset activation are recognized When the words match, a first prompt message is sent. The first prompt message is used to prompt the user to start recording voice data on the terminal. The first prompt message is a prompt sound or a vibration prompt.
  20. 根据权利要求13-19中任意一项所述的终端,其特征在于,所述处理器,还用于控制所述触摸屏显示第一界面之前,在录制所述语音数据过程中,若识别到所述麦克风捕获的第二语音数据对应的文本与预置结束醒词匹配,则停止录制语音数据。The terminal according to any one of claims 13 to 19, wherein the processor is further configured to control the touch screen to display the first interface, if the voice data is recognized during recording of the voice data, If the text corresponding to the second voice data captured by the microphone matches the preset end word, stop recording the voice data.
  21. 根据权利要求20所述的终端,其特征在于,所述处理器,还用于在录制所述语音数据过程中,若识别到所述麦克风捕获的第二语音数据对应的文本与所述预置结束醒词匹配,则发出第二提示信息,所述第二提示信息用于提示用户所述第一终端停止录制语音数据,所述第二提示信息为提示音或者振动提示。The terminal according to claim 20, wherein the processor is further configured to, during the recording of the voice data, if the text corresponding to the second voice data captured by the microphone and the preset are recognized After the wake-up word matching is ended, a second prompt message is issued, the second prompt message is used to prompt the user to stop recording the voice data by the first terminal, and the second prompt message is a prompt sound or a vibration prompt.
  22. 根据权利要求21所述的终端,其特征在于,所述第一提示信息和第二提示信 息为提示音,所述处理器,还用于在所述第一语音数据对应的文本与所述预置启动词匹配时,确定由所述扬声器播放语音数据;根据所述扬声器播放的语音数据对所述麦克风采集的语音数据进行回声抑制;在所述受话器或者扬声器播放所述第二提示信息后,停止对所述麦克风采集的语音数据进行回声抑制。The terminal according to claim 21, wherein the first prompt information and the second prompt information are prompt sounds, and the processor is further configured to add a text corresponding to the first voice data to the pre- When the start word is matched, it is determined that the speaker plays voice data; the voice data collected by the microphone is echo suppressed according to the voice data played by the speaker; after the second prompt message is played by the receiver or speaker, Stop performing echo suppression on the voice data collected by the microphone.
  23. 根据权利要求16-18中任意一项所述的终端,其特征在于,所述存储器按照录制每段语音数据的时间先后顺序,以及每段语音数据的来源信息,保存所述至少两段语音数据;The terminal according to any one of claims 16 to 18, wherein the memory stores the at least two pieces of voice data according to a chronological order of recording each piece of voice data and a source information of each piece of voice data ;
    所述来源信息用于指示语音数据段是所述麦克风捕获,或者所述来源信息用于指示语音数据段是所述第二终端的语音数据。The source information is used to indicate that the voice data segment is captured by the microphone, or the source information is used to indicate that the voice data segment is voice data of the second terminal.
  24. 根据权利要求16-18或23中任意一项所述的终端,其特征在于,所述触摸屏显示的所述第一界面中还包括至少两个播放器插件,所述至少两个播放插件与所述至少两段文本信息一一对应,所述至少两个播放插件与所述至少两段语音数据一一对应;The terminal according to any one of claims 16-18 or 23, wherein the first interface displayed on the touch screen further includes at least two player plug-ins, and the at least two playback plug-ins and the The at least two pieces of text information correspond one-to-one, and the at least two playback plug-ins correspond to the at least two pieces of voice data one-to-one;
    所述处理器,还用于接收用户对所述至少两个播放器插件中第一播放器插件的点击操作,控制所述受话器或者所述扬声器播放所述第一播放器插件对应的语音数据段。The processor is further configured to receive a user's click operation on a first player plug-in of the at least two player plug-ins, and control the receiver or the speaker to play a voice data segment corresponding to the first player plug-in. .
  25. 一种计算机存储介质,其特征在于,所述计算机存储介质包括计算机指令,当所述计算机指令在终端上运行时,使得所述终端执行如权利要求1-12中任意一项所述的通信过程中信息的记录及显示方法。A computer storage medium, characterized in that the computer storage medium includes computer instructions, and when the computer instructions are run on a terminal, cause the terminal to execute the communication process according to any one of claims 1-12 Information recording and display methods.
  26. 一种计算机程序产品,其特征在于,当所述计算机程序产品在计算机上运行时,使得所述计算机执行如权利要求1-12中任意一项所述的通信过程中信息的记录及显示方法。A computer program product, characterized in that when the computer program product is run on a computer, the computer is caused to execute a method for recording and displaying information during a communication process according to any one of claims 1-12.
PCT/CN2018/105571 2018-09-13 2018-09-13 Method for recording and displaying information in communication process, and terminals WO2020051852A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2018/105571 WO2020051852A1 (en) 2018-09-13 2018-09-13 Method for recording and displaying information in communication process, and terminals
CN201880090760.2A CN111819830B (en) 2018-09-13 2018-09-13 Information recording and displaying method and terminal in communication process

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2018/105571 WO2020051852A1 (en) 2018-09-13 2018-09-13 Method for recording and displaying information in communication process, and terminals

Publications (1)

Publication Number Publication Date
WO2020051852A1 true WO2020051852A1 (en) 2020-03-19

Family

ID=69777304

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/105571 WO2020051852A1 (en) 2018-09-13 2018-09-13 Method for recording and displaying information in communication process, and terminals

Country Status (2)

Country Link
CN (1) CN111819830B (en)
WO (1) WO2020051852A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114079809A (en) * 2020-08-20 2022-02-22 华为技术有限公司 Terminal and input method and device thereof
CN116087930A (en) * 2022-08-18 2023-05-09 荣耀终端有限公司 Audio ranging method, device, storage medium, and program product

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103581400A (en) * 2013-11-26 2014-02-12 广东欧珀移动通信有限公司 Method for storing phone numbers in communication process
CN103888581A (en) * 2014-03-28 2014-06-25 深圳市中兴移动通信有限公司 Communication terminal and method for recording communication information thereof
CN104754100A (en) * 2013-12-25 2015-07-01 深圳桑菲消费通信有限公司 Call recording method and device and mobile terminal
CN106357932A (en) * 2016-11-22 2017-01-25 奇酷互联网络科技(深圳)有限公司 Call information recording method and mobile terminal
CN107040640A (en) * 2017-05-26 2017-08-11 黄晓咏 A kind of communication module for being used to record message

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101211615B (en) * 2006-12-31 2012-10-03 于柏泉 System for automatic recording for specific human voice
CN101277338A (en) * 2007-03-29 2008-10-01 西门子(中国)有限公司 Method for recording downstream voice signal of communication terminal as well as the communication terminal
CN103577965B (en) * 2012-07-30 2018-10-16 腾讯科技(深圳)有限公司 A kind of affairs prompt method and apparatus
CN105657149B (en) * 2015-08-25 2019-05-14 南京酷派软件技术有限公司 A kind of voice communication method, system and communication terminal
CN106385485A (en) * 2016-08-25 2017-02-08 广东欧珀移动通信有限公司 Call recording method, call recording device and mobile terminal
CN106131288A (en) * 2016-08-25 2016-11-16 深圳市金立通信设备有限公司 The recording method of a kind of call-information and terminal
CN107295174B (en) * 2017-06-22 2020-11-10 深圳传音通讯有限公司 Method, device and terminal for recording call content
CN107591150A (en) * 2017-08-16 2018-01-16 珠海市魅族科技有限公司 Audio recognition method and device, computer installation and computer-readable recording medium
CN107464557B (en) * 2017-09-11 2021-05-07 Oppo广东移动通信有限公司 Call recording method and device, mobile terminal and storage medium
CN108172247A (en) * 2017-12-22 2018-06-15 北京壹人壹本信息科技有限公司 Record playing method, mobile terminal and the device with store function

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103581400A (en) * 2013-11-26 2014-02-12 广东欧珀移动通信有限公司 Method for storing phone numbers in communication process
CN104754100A (en) * 2013-12-25 2015-07-01 深圳桑菲消费通信有限公司 Call recording method and device and mobile terminal
CN103888581A (en) * 2014-03-28 2014-06-25 深圳市中兴移动通信有限公司 Communication terminal and method for recording communication information thereof
CN106357932A (en) * 2016-11-22 2017-01-25 奇酷互联网络科技(深圳)有限公司 Call information recording method and mobile terminal
CN107040640A (en) * 2017-05-26 2017-08-11 黄晓咏 A kind of communication module for being used to record message

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114079809A (en) * 2020-08-20 2022-02-22 华为技术有限公司 Terminal and input method and device thereof
CN116087930A (en) * 2022-08-18 2023-05-09 荣耀终端有限公司 Audio ranging method, device, storage medium, and program product
CN116087930B (en) * 2022-08-18 2023-10-20 荣耀终端有限公司 Audio ranging method, device, storage medium, and program product

Also Published As

Publication number Publication date
CN111819830B (en) 2022-05-17
CN111819830A (en) 2020-10-23

Similar Documents

Publication Publication Date Title
WO2021213120A1 (en) Screen projection method and apparatus, and electronic device
WO2020019176A1 (en) Method for updating wake-up voice of voice assistant by terminal, and terminal
CN110401767B (en) Information processing method and apparatus
WO2022100610A1 (en) Screen projection method and apparatus, and electronic device and computer-readable storage medium
US11848016B2 (en) Voice control command generation method and terminal
US20220179827A1 (en) File Sharing Method of Mobile Terminal and Device
WO2020019355A1 (en) Touch control method for wearable device, and wearable device and system
WO2020056684A1 (en) Method and device employing multiple tws earpieces connected in relay mode to realize automatic interpretation
WO2021175266A1 (en) Identity verification method and apparatus, and electronic devices
CN112543447A (en) Device discovery method based on address list, audio and video communication method and electronic device
CN114422340A (en) Log reporting method, electronic device and storage medium
CN113676339B (en) Multicast method, device, terminal equipment and computer readable storage medium
CN114115770A (en) Display control method and related device
CN111492678B (en) File transmission method and electronic equipment
WO2020051852A1 (en) Method for recording and displaying information in communication process, and terminals
CN113467735A (en) Image adjusting method, electronic device and storage medium
CN115119336B (en) Earphone connection system, earphone connection method, earphone, electronic device and readable storage medium
CN113467747B (en) Volume adjusting method, electronic device and storage medium
CN114120987B (en) Voice wake-up method, electronic equipment and chip system
CN115525366A (en) Screen projection method and related device
CN114822525A (en) Voice control method and electronic equipment
CN113867520A (en) Device control method, electronic device, and computer-readable storage medium
CN114554012A (en) Incoming call answering method, electronic equipment and storage medium
WO2023020420A1 (en) Volume display method, electronic device, and storage medium
CN114071055B (en) Method for rapidly joining conference and related equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18933381

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18933381

Country of ref document: EP

Kind code of ref document: A1