WO2016184109A1 - 一种保存录音、显示和播放图片形式录音的方法、终端 - Google Patents

一种保存录音、显示和播放图片形式录音的方法、终端 Download PDF

Info

Publication number
WO2016184109A1
WO2016184109A1 PCT/CN2015/098960 CN2015098960W WO2016184109A1 WO 2016184109 A1 WO2016184109 A1 WO 2016184109A1 CN 2015098960 W CN2015098960 W CN 2015098960W WO 2016184109 A1 WO2016184109 A1 WO 2016184109A1
Authority
WO
WIPO (PCT)
Prior art keywords
file
recording
audio data
picture
information
Prior art date
Application number
PCT/CN2015/098960
Other languages
English (en)
French (fr)
Inventor
袁强
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2016184109A1 publication Critical patent/WO2016184109A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/64Automatic arrangements for answering calls; Automatic arrangements for recording messages for absent subscribers; Arrangements for recording conversations
    • H04M1/65Recording arrangements for recording a message from the calling party
    • H04M1/6505Recording arrangements for recording a message from the calling party storing speech in digital form
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/64Automatic arrangements for answering calls; Automatic arrangements for recording messages for absent subscribers; Arrangements for recording conversations
    • H04M1/65Recording arrangements for recording a message from the calling party
    • H04M1/656Recording arrangements for recording a message from the calling party for recording conversations

Definitions

  • This document relates to, but not limited to, data storage technology, and in particular, to a method for saving a recording, playing a recording, and a terminal.
  • the recording mode is relatively simple.
  • the commonly used method is to save the recording as an audio file, and then distinguish the file in the file name of the recording file. Either record the call time with the file name or record the call number with the file name. In this way, the call recording is saved, and the recorded information is often too simple and too thin. Due to the limitation of the length of the file name, etc., the call recording information cannot be deeply saved, and the confidentiality is not good. In the case of long talk time, it is often impossible to recall the specific scene of this call recording. For example, at present, dual-card mobile terminals are very popular, and it is also common for users to switch between mobile terminals using multiple cards. The call recording of the mobile terminal is currently unable to display a variety of call information.
  • the embodiment of the invention provides a method for saving a recording, playing a recording, and a terminal, to solve the technical problem of how to display a plurality of types of call information.
  • a method of saving a recording applied to a terminal including:
  • the audio data and the plurality of information are saved to a file, wherein the plurality of information is saved into the display content of the file.
  • Saving the audio data and the plurality of information into the file includes: saving the audio data and the plurality of information into the same file.
  • the file is a picture file
  • Saving the audio data and the plurality of information into the same file including: saving the plurality of information into a display content of the picture file, and filling the audio data into a specified reserved segment of the picture file .
  • Saving the audio data and the plurality of information to the same file further includes: adding a recording identifier at the designated position of the designated reserved segment, wherein the recording identifier is used to identify the image file as a picture file for saving the recording.
  • Filling the audio data into a specified reserved segment of the picture file includes: filling the audio data into a specified reserved segment of the picture file; or, after the piece of audio data is sliced, separately padding Go to multiple specified reserved segments of the picture file.
  • the picture file adopts a picture format formulated by the Joint Picture Experts Group JPEG, and the specified reserved segment includes one or more APP15 segments.
  • the file is a video file
  • Saving the audio data and the plurality of information into the same file includes: saving the plurality of information to display content of one or more frames of images, and then combining the one or more frames of the image with the audio The data is spliced to generate the video file.
  • Saving the audio data and the plurality of information into the file includes: saving the audio data and the plurality of information into a plurality of files.
  • the file names of the plurality of files are at least partially identical.
  • the plurality of information includes at least two of the following information: a call start time, a call end time, a calling number, a called number, a call duration, a calling location, a called location, a calling contact, and a called contact. And all or part of the call content obtained by voice recognition.
  • the recording is a call recording.
  • a method for displaying and playing a picture format recording, applied to a terminal comprising:
  • the picture file is a picture file that saves the recording, display the display content of the picture file, and extract the audio data of the recording from the picture file for playing; wherein the display content includes the recording Kind of information.
  • Parsing the content of the picture file includes: parsing a specified reserved segment of the picture file, and if the specified position of the specified reserved segment has a recording identifier, the picture file is a picture file for saving the recording;
  • Extracting the recorded audio data from the other content of the picture file for playing comprising: extracting the recorded audio data from the specified reserved segment for playing.
  • Extracting the recorded audio data from the specified reserved segment for playing including:
  • the recorded audio identifier is extracted from the designated reserved segment and played;
  • the designated location has a recording identifier, and the audio data is separately extracted from the plurality of designated reserved segments, and is spliced into a whole piece of data and then played.
  • the picture file adopts a picture format formulated by the Joint Picture Experts Group JPEG, and the specified reserved segment includes one or more APP15 segments.
  • Parsing the specified reserved segment of the picture file if the designated location of the specified reserved segment data has a recording identifier, determining that the image file is a picture file that saves the recording;
  • the plurality of pieces of information of the recording includes at least two of the following information: call start time, call end time, calling number, called number, call duration, calling position, called position, calling contact, called Contact, and all or part of the call content obtained by voice recognition.
  • the recording is a call recording.
  • a terminal that can save recordings including:
  • a recording module configured to record audio data and various information of the recording during recording
  • the recording save module is configured to save the audio data and the plurality of information into the file, wherein the plurality of information is saved into the display content of the file.
  • the recording save module is configured to save the audio data and the plurality of information into the file by: saving the audio data and the plurality of information into the same file.
  • the file is a picture file
  • the recording save module is configured to save the audio data and the plurality of information into the same file by: saving the plurality of information into display content of the image file, and the audio The data is populated into the specified reserved section of the picture file.
  • the recording save module is further configured to save the audio data and various information to the same file And adding a recording identifier to the designated location of the designated reserved segment, where the recording identifier is used to identify the image file as a picture file for saving the recording.
  • the recording save module is configured to fill the audio data into a specified reserved segment of the picture file by populating the audio data into a designated reserved segment of the picture file; or After the audio data is sliced, it is filled into a plurality of designated reserved segments of the picture file.
  • the picture file adopts a picture format formulated by the Joint Picture Experts Group JPEG, and the specified reserved segment includes one or more APP15 segments.
  • the file is a video file
  • the recording save module is configured to save the audio data and the plurality of information into the same file by saving the plurality of information into display content of one or more frames of images, and then One or more frames of images are spliced with the audio data to generate the video file.
  • the recording save module is configured to save the audio data and various information by saving the audio data and various information into a plurality of files.
  • the file names of the plurality of files are at least partially identical.
  • the plurality of pieces of information of the recording includes at least two of the following information: call start time, call end time, calling number, called number, call duration, calling position, called position, calling contact, called Contact, and all or part of the call content obtained by voice recognition.
  • the recording is a call recording.
  • a terminal capable of playing a picture format recording comprising:
  • a picture parsing module configured to parse the content of the picture file after receiving an instruction to open the picture file
  • a picture display and play module configured to: when the picture parsing module determines that the picture file is a picture file that saves a recording, display display content of the picture file, and extract the recording from other content of the picture file The audio data is played; wherein the display content contains various information of the recording.
  • the image parsing module is configured to parse the content of the image file by parsing a specified reserved segment of the image file, and determining the image file if the designated location of the specified reserved segment has a recording identifier. Is to save the recorded picture file;
  • the picture display and play module is configured to extract audio data of the recorded sound from other content of the picture file for playing by: extracting the recorded audio data from the specified reserved segment for playing.
  • the picture display and play module is configured to perform playback of the audio data of the recorded sound from the designated reserved segment by:
  • the recorded audio identifier is extracted from the designated reserved segment and played;
  • the designated location has a recording identifier, and the audio data is separately extracted from the plurality of designated reserved segments, and is spliced into a whole piece of data and then played.
  • the picture file adopts a picture format formulated by the Joint Picture Experts Group JPEG, and the reserved segment includes an APP15 segment.
  • the image parsing module is configured to perform the following processing by the image parsing library or the application of the operating system framework layer: parsing the specified reserved segment of the image file, and if the designated position of the specified reserved segment has a recording identifier, determining The picture file is a picture file that saves the recording; and/or
  • the picture display and play module is configured to perform the following processing by using a picture parsing library or an application of the operating system framework layer: extracting the recorded audio data from the specified reserved segment for playing; wherein playing the audio data is Transfer audio data to the audio player for playback.
  • the plurality of pieces of information of the recording includes at least two of the following information: call start time, call end time, calling number, called number, call duration, calling position, called position, calling contact, called Contact, and all or part of the call content obtained by voice recognition.
  • the recording is a call recording.
  • the embodiment of the invention further provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used to execute the above method.
  • the embodiment of the invention expands the information saved by the call recording, saves it in the display content of the file, improves the user experience, and can also have a certain security effect.
  • FIG. 1 is a flowchart of a method for saving a call recording according to an embodiment of the present invention
  • FIG. 2 is a block diagram of a terminal capable of saving a call recording according to an embodiment of the present invention
  • FIG. 3 is a flowchart of a method for saving a call recording according to Embodiment 2 of the present invention.
  • FIG. 5 is a flowchart of a method for displaying and playing a picture form call recording according to Embodiment 4 of the present invention.
  • FIG. 6 is a block diagram of a terminal capable of playing a call recording in a picture form according to Embodiment 4 of the present invention.
  • FIG. 7 is a schematic diagram of application example 1 of the present invention for filling data in a JPEG picture file APP15 segment.
  • various information of the call recording is saved in the display content to implement extension of the call recording.
  • the various information of the recording refers to a variety of information related to the recording except the audio data
  • the recorded audio includes the call recording
  • the various information of the call recording includes: the call time, the call number, the call duration, the call position, and the call content.
  • the method for saving a call recording in this embodiment is applied to a terminal, as shown in FIG. 1, and includes:
  • Step 110 Record audio data and various information of the call recording during the call recording process
  • This embodiment takes a mobile terminal as an example.
  • the mobile terminal makes a call
  • the user manually selects or automatically starts the call recording.
  • the mobile terminal records the audio data of the call recording in real time, such as being stored in the cache.
  • the recorded audio data may be PCM (Pulse Code Modulation) audio data, but the present invention is not limited thereto.
  • the mobile terminal can obtain a variety of information of the call recording.
  • the plurality of information may include at least two of the following information: a call start time, a call end time, a calling number, a called number, a call duration, a calling location, a called location, a calling contact, and a called party.
  • Contact and all or part of the call content obtained by voice recognition.
  • Step 120 Save the audio data and various information into a file, where the multiple information is saved into the display content of the file.
  • the audio data and various information can be saved in the same file.
  • the audio data and the plurality of information are saved to a plurality of files.
  • the file names of the plurality of files are at least partially identical.
  • the manner in which the audio data and the plurality of information are saved to the plurality of files includes: saving a plurality of information to the display content of the image file, and the audio data is saved by using the audio file; or, each of the plurality of files respectively saves the Part of the recorded data, and multiple files contain a variety of information of the recording.
  • the file that saves the audio data and the plurality of types of information may be a file that has the display content and can store the audio data.
  • the image file and the video file are respectively taken as an example for description.
  • a variety of information can usually be displayed in the form of text, but can also be displayed in other forms, such as the call time can also be displayed as a corresponding clock pattern, the call location can also be displayed as a corresponding map, and the like. .
  • a file may be generated, the buffered audio data and various information may be saved in the file, or a temporary file may be generated after the call recording starts, and the collected audio data and various information are saved.
  • an official file is generated after the call recording is completed.
  • the embodiment of the invention further provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used to execute the above method.
  • the embodiment further provides a terminal that can save the call recording, and can be a mobile terminal such as a mobile phone, but is not limited thereto.
  • the terminal includes:
  • the call recording module 10 is configured to record audio data and various information of the call recording during the call recording process
  • the recording save module 20 is configured to save the audio data and various information into a file, wherein the plurality of information is saved into the display content of the file.
  • the plurality of information of the call recording includes at least two of the following information: a call start time, a call end time, a calling number, a called number, a call duration, a calling location, a called location, a calling contact, and a Call the contact, and all or part of the call content obtained by voice recognition.
  • the display content (such as the image display content and the video display content) can record many calls by itself.
  • the relevant information at the time enriches the content of the call recording. It helps users to recall the specific scene of this call recording, thus improving the user experience.
  • the method for saving a call recording in this embodiment is as shown in FIG. 3, and includes:
  • Step 210 Record audio data and various information of the call recording during the call recording process
  • This step is the same as step 110.
  • Step 220 Save the plurality of information into a display content of the picture file, and fill the audio data into a designated reserved segment of the picture file.
  • the reserved segment is specified to be dedicated to saving the audio data, it is determined whether audio data exists according to whether the specified reserved segment is empty. If it is not dedicated to audio data, it is necessary to add a recording identifier at a specified position of the specified reserved segment, which is used to identify the picture file as a picture file for saving the call recording.
  • the specified location may be a first location and/or a tail location specifying reserved segment data.
  • the audio data Comparing the audio data with the size of the specified reserved segment when the audio data is filled into the specified reserved segment of the picture file, if the audio data can be filled into a specified reserved segment, the audio data is directly filled into the picture A specified reserved segment of the file; if the audio data is larger than the size of the specified reserved segment, the audio data needs to be sliced and filled into a plurality of designated reserved segments of the image file.
  • the picture file is in a picture format such as JPEG, JPEG2000, etc., which was developed by the Joint Photographic Experts Group (JPEG).
  • the designated reserved segment includes one or more APP15 segments, and the audio data of the call recording is also filled into the APP15 segment of the JPEG picture.
  • JPEG Joint Photographic Experts Group
  • the display content of the image can be static or dynamic.
  • the picture file generated in this embodiment needs to be processed for extracting and playing audio data after being opened, which will be explained in Embodiment 4.
  • the call recording can be saved in a particular picture file (non-generic format), opened using a particular application.
  • the position where the image file saves the audio data can be separately defined, and does not need to be saved in the reserved segment.
  • the embodiment of the invention further provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used to execute the above method.
  • the embodiment further provides a terminal that can save a call recording, including:
  • a call recording module configured to record audio data and various information of the call recording during a call recording process
  • the recording save module is configured to save the plurality of information into the display content of the picture file, and fill the audio data into a designated reserved segment of the picture file.
  • the recording and saving module saves the audio data and the plurality of information into a file, and further includes: adding a recording identifier to the designated position of the designated reserved segment, where the recording identifier is used to identify the image file for saving The picture file of the call recording.
  • the recording and saving module includes: filling the audio data into a designated reserved segment of the image file; or dividing the audio data into After the slice, they are respectively filled into a plurality of designated reserved segments of the picture file.
  • the picture file adopts a picture format formulated by the Joint Picture Experts Group JPEG, and the specified reserved segment includes one or more APP15 segments.
  • the plurality of information of the call recording includes at least two of the following information: a call start time, a call end time, a calling number, a called number, a call duration, a calling location, a called location, a calling contact, and a Call the contact, and all or part of the call content obtained by voice recognition.
  • various information of the call recording is saved in the display content of the picture file, and information such as the time of the call, the location of the call, the number of the called party, and the duration of the call can be visually displayed in the form of a picture. Thereby helping the user to recall the specific scene of the call recording and improving the user experience.
  • the method for saving call recording in this embodiment is as shown in FIG. 4, and includes:
  • Step 310 Record audio data and various information of the call recording during the call recording process
  • This step is the same as step 110.
  • Step 320 Save the plurality of information to display content of one or more frames of images, and then splicing the one or more frames of images with the audio data to generate the video file.
  • the audio data of the video file generated in this step is the audio data of the call recording, and the image data includes various information such as the call time, the call number, the call duration, and the call position of the call recording. This not only expands the call recording, but also preserves the versatility of the call recording.
  • the video file generated in this embodiment is an ordinary video file, and can be played normally even if the mobile terminal is replaced.
  • an image frame including information such as the talk time, the calling number and the call duration, and the like, and the audio content of the call recording are played.
  • the embodiment of the invention further provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used to execute the above method.
  • the embodiment further provides a terminal that can save a call recording, including:
  • a call recording module configured to record audio data and various information of the call recording during a call recording process
  • the recording and saving module saves the plurality of pieces of information into display content of one or more frames of images, and then splices the one or more frames of images with the audio data to generate the video file.
  • the plurality of information of the call recording includes at least two of the following information: a call start time, a call end time, a calling number, a called number, a call duration, a calling location, a called location, a calling contact, and a Call the contact, and all or part of the call content obtained by voice recognition.
  • the embodiment provides a method for displaying and playing a call recording in the form of a picture, and can display and play the picture file for saving the call recording in the second embodiment.
  • the flow of the method in this embodiment is as shown in FIG. 5, and includes:
  • Step 410 After receiving an instruction to open a picture file, parsing the content of the picture file;
  • Step 420 it is determined whether the picture file saves the picture file of the call recording, and if so, step 440 is performed, and if no, step 430 is performed;
  • Step 430 displaying the content of the picture, ending;
  • Step 440 Display display content of the picture file, and extract audio data of the call recording from other content of the picture file for playing; wherein the display content includes various information of the call recording.
  • the audio data of the call recording is extracted from the designated reserved segment for playing. If there is a designated location of the specified reserved segment, there is a recording identifier, and the audio data of the call recording is extracted from the designated reserved segment and played; if there are multiple designated designated segments, the designated location has a recording identifier And extracting the audio data from the plurality of designated reserved segments, and splicing into a whole piece of data and playing the same.
  • the picture file employs a picture format developed by the Joint Picture Experts Group JPEG, and the specified reserved segment includes one or more APP 15 segments.
  • the following one or more processes are the terminal operations.
  • the image parsing library of the system framework layer is executed:
  • a common image application can be used to open the image file for saving the call recording and to play the audio.
  • the present invention is not limited thereto.
  • the foregoing processing may also be performed by an application, that is, providing a specific application, which may implement picture display and audio playback of a picture file for saving a call recording. There is no need to change the image parsing library of the operating system framework layer.
  • the plurality of information of the call recording includes at least two of the following information: a call start time, a call end time, a calling number, a called number, a call duration, a calling location, a called location, a calling contact, and a Call the contact, and all or part of the call content obtained by voice recognition.
  • the information such as the call time, the calling number and the duration of the call is displayed in the form of a picture, and the content of the call recording is played in the form of audio.
  • a particular picture file is used as a picture file for saving call recordings.
  • the image file is a picture file for saving the call recording, and the storage location of the audio data in the image file can be agreed in the file format, and is not necessarily saved in the reserved segment.
  • the embodiment of the invention further provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used to execute the above method.
  • the embodiment further provides a terminal capable of playing a call recording in the form of a picture, as shown in FIG. 6, comprising:
  • the picture parsing module 50 is configured to parse the content of the picture file after receiving an instruction to open the picture file;
  • a picture display and play module 60 configured to determine the picture file in the picture parsing module When the picture file of the call recording is saved, the display content of the picture file is displayed, and the audio data of the call recording is extracted from other contents of the picture file for playing; wherein the display content includes the call recording a variety of information.
  • the image parsing module parsing the content of the image file includes: parsing a specified reserved segment of the image file, and if the specified location of the specified reserved segment has a recording identifier, determining that the image file is a saved call recording Picture file
  • the picture display and playback module extracts the audio data of the call recording from the other content of the picture file for playing, including: extracting audio data of the call recording from the specified reserved segment for playing.
  • the picture display and play module extracts the audio data of the call recording from the specified reserved segment for playing, including:
  • the designated location has a recording identifier, and the audio data is separately extracted from the plurality of designated reserved segments, and is spliced into a whole piece of data and then played.
  • the picture file adopts a picture format formulated by the Joint Picture Experts Group JPEG, and the reserved segment includes an APP15 segment.
  • the image parsing module performs the following processing by the image parsing library or the application of the operating system framework layer: parsing the specified reserved segment of the image file, and determining the image file if the designated location of the specified reserved segment has a recording identifier Is a picture file that saves the call recording; and/or
  • the picture display and play module performs the following processing by using a picture parsing library or an application of the operating system framework layer: extracting audio data of the call recording from the specified reserved segment for playing; wherein playing the audio data is audio The data is transferred to the audio player for playback.
  • the plurality of information of the call recording includes at least two of the following information: a call start time, a call end time, a calling number, a called number, a call duration, a calling location, a called location, a calling contact, and a Call the contact, and all or part of the call content obtained by voice recognition.
  • the present invention is not limited to the call recording.
  • the method of the above embodiment may be used to save and open, or when a lot of speech is recorded (not necessarily the user passes Information about the call between the terminals, enriching the recorded content. Help users recall the specific scene of this recording.
  • This example saves the call recording as a JEPG image format, including:
  • Step 1 When the user is in a call, select recording, and at this time, the mobile terminal records the PCM audio data of the start recording time and the call recording;
  • Step 2 At the end of the call, generate a JPEG picture file, and save various information of the recorded call such as the call time, the calling number and the call duration in the display content of the picture;
  • the Canvas class can be called to save information such as talk time, calling and called number, and call duration in the display content of the JPEG image
  • Core Graphics can be called to save the information in the JPEG image. In the display content.
  • Step 3 Fill the PCM audio data in the APP15 segment of the generated JPEG picture file.
  • the partial markers (Marker) defined by JPEG are shown in Table 1:
  • the JPEG standard has a large number of reserved segments, only a few of which are listed above.
  • the APP 0-15 segment is a mark reserved by the JPEG standard for the application itself, and is suitable for filling audio data.
  • JPEG APP1-15 partition The definition of JPEG APP1-15 partition is shown in Table 2:
  • the partition data segment does not exceed 64k bytes at the maximum.
  • multiple APP15 segments can be defined, and the audio data is sliced and saved in the plurality of APP15 segments.
  • a threshold can be set. This threshold is less than the maximum width of the APP15 partition of 64k bytes. Comparing the size of the PCM audio data recorded by the call with the threshold. If the threshold is not exceeded, the PCM recording data is stored in an APP15 segment of the JPEG picture file; if the threshold is exceeded, the PCM audio data is The shards are sequentially sharded, and the size of each shard is less than or equal to the threshold. After sharding, each shard data is sequentially saved into an APP15 segment.
  • the PCM audio data is marked with the set recording identifier.
  • the recording identifier used to mark the PCM audio data can distinguish the audio data from other data.
  • This example provides a method for displaying and playing a call recording in the form of a picture, including:
  • Step 1 Open a JPEG picture file on the terminal, and find whether there is a recording identifier in the APP15 segment of the JPEG picture file to determine whether the JPEG picture file is a JPEG picture file for saving the call recording;
  • this picture is a call recording.
  • special processing of the APP15 partition can be added in the JPEG image parsing library of the operating system framework layer.
  • the partition mark of the APP15 segment is read in which the recording identifier is found, if in the "details" section of the APP15 partition.
  • the recording identifier is found both in the head and the tail, and it can be determined that the JPEG picture file is a JPEG picture file in which the call recording is saved.
  • the JPEG image parsing library of the operating system framework layer can notify the upper layer application that the JPEG image file is a JPEG image file for saving the call recording.
  • the JPEG picture file is normally opened and the picture is displayed.
  • Step 2 If there is a recording identifier, it indicates that the JPEG picture file is a JPEG picture file for saving the call recording, and firstly displays the display content of the picture (here includes various information such as call time, calling and called number, and call duration). At the same time, the recorded data hidden in the APP15 segment is extracted and played to the user in the form of audio.
  • the PCM audio data needs to be extracted segment by segment, and the PCM audio data is spliced into a whole segment of data and played.
  • the splicing work can be done in the JPEG parsing logic of the operating system framework layer. After the analysis is completed, the PCM audio data stream is transmitted to the mediaplayer for playback. Doing these actions in the framework layer can reduce the work of the upper application.
  • the method includes:
  • Step 1 When the user is in a call, select recording, at this time, the mobile terminal records the recording time of starting the call and starts recording the PCM audio data of the call recording;
  • Step 2 At the end of the call, generate a frame image, and save information such as the call time, the call number, the call duration, and the call position into the frame image;
  • multiple frames of images may also be generated to hold the information.
  • Step 3 splicing the frame image generated in step 2 and the PCM audio data of the call recording to generate a video file.
  • the MPEG4 video file is spliced, the generated frame image is used as the first frame, and the PCM audio data is used as the audio track.
  • the video of the same length is generated according to the length of the PCM data, and the frame-by-frame interpolation is performed in the time dimension. .
  • the audio data of the call recording is used as the audio content of the generated video file, and the video content is filled with the image frame constructed by the information such as the talk time, the calling and called number, and the duration of the call; after the splicing of the two is completed, the video is played.
  • each module/unit in the above embodiment may be implemented in the form of hardware, for example, by implementing an integrated circuit to implement its corresponding function, or may be implemented in the form of a software function module, for example, executing a program stored in the memory by a processor. / instruction to achieve its corresponding function.
  • the invention is not limited to any specific form of combination of hardware and software.
  • the above technical solution avoids the limitation that the related technology suffers when the call recording information is saved in the file name, and the display content (such as the picture display content and the video display content) itself can record a lot of related information during the call, enriching the content of the call recording. It helps users to recall the specific scene of this call recording, thus improving the user experience.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

一种保存录音、显示和播放图片形式录音的方法、终端。终端在录音过程中,记录所述录音的音频数据和多种信息(110);保存所述音频数据和多种信息,其中,将所述多种信息保存到文件的显示内容中(120)。终端接收到打开图片文件的指令后,解析所述图片文件的内容;如所述图片文件是保存录音的图片文件,显示所述图片文件的显示内容,并从所述图片文件中提取所述录音的音频数据进行播放;其中,所述显示内容包含所述录音的多种信息。该方法对通话录音保存的信息进行扩展,保存在文件的显示内容中,改善了用户体验,还可以具有一定的保密效果。

Description

一种保存录音、显示和播放图片形式录音的方法、终端 技术领域
本文涉及但不限于数据存储技术,尤其涉及一种保存录音、播放录音的方法、终端。
背景技术
目前录音的保存方式比较单一,普遍采用的方法都是将录音保存为音频文件,再在录音文件的文件名上对文件进行一定的区分。或是用文件名记录通话时间,或是用文件名记录通话号码。采用此种方式保存通话录音,记录的信息往往过于简单,展示起来过于单薄。由于文件名的长度等限制,无法深度保存通话录音信息,而且保密性也不好。在通话时间较久的情况下,经常无法回忆起此通话录音的具体场景。例如,目前双卡移动终端非常普及,用户使用多张卡在移动终端中切换的现象也很常见。移动终端的通话录音现今采用的技术方案无法展示多种的通话信息。
发明内容
以下是对本文详细描述的主题的概述。本概述并非是为了限制权利要求的保护范围。
本发明实施例提供了一种保存录音、播放录音的方法、终端,以解决如何展示多种通话信息的的技术问题。
一种保存录音的方法,应用于终端,包括:
在录音过程中,记录所述录音的音频数据和多种信息;
将所述音频数据和多种信息保存到文件中,其中,将所述多种信息保存到所述文件的显示内容中。
可选地,
将所述音频数据和多种信息保存到文件中,包括:将所述音频数据和多种信息保存到同一文件中。
可选地,
所述文件为图片文件;
将所述音频数据和多种信息保存到同一文件中,包括:将所述多种信息保存到所述图片文件的显示内容中,将所述音频数据填充到所述图片文件的指定保留段中。
可选地,
将所述音频数据和多种信息保存到同一文件中,还包括:在所述指定保留段的指定位置添加录音标识符,所述录音标识符用于标识本图片文件为保存录音的图片文件。
可选地,
将所述音频数据填充到所述图片文件的指定保留段中,包括:将所述音频数据填充到所述图片文件的一个指定保留段中;或者,将所述音频数据分片后,分别填充到所述图片文件的多个指定保留段中。
可选地,
所述图片文件采用联合图片专家小组JPEG制定的图片格式,所述指定保留段包括一个或多个APP15段。
可选地,
所述文件为视频文件;
将所述音频数据和多种信息保存到同一文件中,包括:将所述多种信息保存到一帧或多帧图像的显示内容中,再将所述一帧或多帧图像与所述音频数据拼接,生成所述视频文件。
可选地,
将所述音频数据和多种信息保存到文件中,包括:将所述音频数据和多种信息保存到多个文件中。所述多个文件的文件名至少部分相同。
可选地,
所述多种信息包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
可选地,
所述录音为通话录音。
一种显示和播放图片形式录音的方法,应用于终端,包括:
接收到打开图片文件的指令后,解析所述图片文件的内容;
如所述图片文件是保存录音的图片文件,显示所述图片文件的显示内容,并从所述图片文件中提取所述录音的音频数据进行播放;其中,所述显示内容包含所述录音的多种信息。
可选地,
解析所述图片文件的内容,包括:解析所述图片文件的指定保留段,如所述指定保留段的指定位置有录音标识符,则所述图片文件是保存录音的图片文件;
从所述图片文件的其他内容中提取所述录音的音频数据进行播放,包括:从所述指定保留段中提取所述录音的音频数据进行播放。
可选地,
从所述指定保留段中提取所述录音的音频数据进行播放,包括:
如有一个所述指定保留段的指定位置有录音标识符,从该指定保留段中提取所述录音的音频数据并进行播放;
如有多个所述指定保留段的指定位置有录音标识符,从该多个指定保留段中分别提取所述音频数据,拼接为一整段数据后进行播放。
可选地,
所述图片文件采用联合图片专家小组JPEG制定的图片格式,所述指定保留段包括一个或多个APP15段。
可选地,
以下一种或多种处理是所述终端操作系统框架层的图片解析库执行的,或是终端中的应用执行的:
解析所述图片文件的指定保留段,如所述指定保留段数据的指定位置有录音标识符,则确定所述图片文件是保存录音的图片文件;
从所述指定保留段中提取所述录音的音频数据进行播放;其中,对音频数据进行播放是将音频数据传送给音频播放器进行播放。
可选地,
所述录音的多种信息包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
可选地,
所述录音为通话录音。
一种可保存录音的终端,包括:
录音模块,设置为在录音过程中,记录所述录音的音频数据和多种信息;
录音保存模块,设置为将所述音频数据和多种信息保存到文件中,其中,将所述多种信息保存到所述文件的显示内容中。
可选地,
所述录音保存模块是设置为通过如下方式实现将所述音频数据和多种信息保存到文件中,包括:将所述音频数据和多种信息保存到同一文件中。
可选地,
所述文件为图片文件;
所述录音保存模块是设置为通过如下方式实现将所述音频数据和多种信息保存到同一文件中,包括:将所述多种信息保存到所述图片文件的显示内容中,将所述音频数据填充到所述图片文件的指定保留段中。
可选地,
所述录音保存模块还设置为将所述音频数据和多种信息保存到同一文件 时,在所述指定保留段的指定位置添加录音标识符,所述录音标识符用于标识本图片文件为保存录音的图片文件。
可选地,
所述录音保存模块是设置为通过如下方式将所述音频数据填充到所述图片文件的指定保留段中:将所述音频数据填充到所述图片文件的一个指定保留段中;或者,将所述音频数据分片后,分别填充到所述图片文件的多个指定保留段中。
可选地,
所述图片文件采用联合图片专家小组JPEG制定的图片格式,所述指定保留段包括一个或多个APP15段。
可选地,
所述文件为视频文件;
所述录音保存模块是设置为通过如下方式实现将所述音频数据和多种信息保存到同一文件中:将所述多种信息保存到一帧或多帧图像的显示内容中,再将所述一帧或多帧图像与所述音频数据拼接,生成所述视频文件。
可选地,
所述录音保存模块是设置为通过如下方式实现保存所述音频数据和多种信息:将所述音频数据和多种信息保存到多个文件中。所述多个文件的文件名至少部分相同。
可选地,
所述录音的多种信息包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
可选地,
所述录音为通话录音。
一种可播放图片形式录音的终端,包括:
图片解析模块,设置为在接收到打开图片文件的指令后,解析所述图片文件的内容;
图片显示和播放模块,设置为在所述图片解析模块确定所述图片文件是保存录音的图片文件时,显示所述图片文件的显示内容,并从所述图片文件的其他内容中提取所述录音的音频数据进行播放;其中,所述显示内容包含所述录音的多种信息。
可选地,
所述图片解析模块是设置为通过如下方式实现解析所述图片文件的内容:解析所述图片文件的指定保留段,如所述指定保留段的指定位置有录音标识符,则确定所述图片文件是保存录音的图片文件;
所述图片显示和播放模块是设置为通过如下方式实现从所述图片文件的其他内容中提取所述录音的音频数据进行播放:从所述指定保留段中提取所述录音的音频数据进行播放。
可选地,
所述图片显示和播放模块是设置为通过如下方式实现从所述指定保留段中提取所述录音的音频数据进行播放:
如有一个所述指定保留段的指定位置有录音标识符,从该指定保留段中提取所述录音的音频数据并进行播放;
如有多个所述指定保留段的指定位置有录音标识符,从该多个指定保留段中分别提取所述音频数据,拼接为一整段数据后进行播放。
可选地,
所述图片文件采用联合图片专家小组JPEG制定的图片格式,所述保留段包括APP15段。
可选地,
所述图片解析模块是设置为通过操作系统框架层的图片解析库或应用执行以下处理:解析所述图片文件的指定保留段,如所述指定保留段的指定位置有录音标识符,则确定所述图片文件是保存录音的图片文件;和/或
所述图片显示和播放模块是设置为通过操作系统框架层的图片解析库或应用完成以下处理:从所述指定保留段中提取所述录音的音频数据进行播放;其中,对音频数据进行播放是将音频数据传送给音频播放器进行播放。
可选地,
所述录音的多种信息包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
可选地,
所述录音为通话录音。
本发明实施例还提供了一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行上述的方法。
本发明实施例对通话录音保存的信息进行扩展,保存在文件的显示内容中,改善了用户体验,还可以具有一定的保密效果。
在阅读并理解了附图和详细描述后,可以明白其他方面。
附图概述
图1是本发明实施例一保存通话录音的方法的流程图;
图2是本发明实施例一可保存通话录音的终端的模块图;
图3是本发明实施例二保存通话录音的方法的流程图;
图4是本发明实施例三保存通话录音的方法的流程图;
图5是本发明实施例四显示和播放图片形式通话录音的方法的流程图;
图6是本发明实施例四可播放图片形式通话录音的终端的模块图;
图7是本发明应用示例一在JPEG图片文件APP15段填充数据的示意图。
本发明的实施方式
下文中将结合附图对本发明的实施例进行详细说明。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互任意组合。
实施例一
本实施例将通话录音的多种信息保存在显示内容中,来实现对通话录音的扩展。文中,录音的多种信息指除音频数据外与录音相关的多种信息,所属录音包括通话录音,通话录音的多种信息包括:通话时间、通话号码、通话时长、通话位置、通话内容等。
本实施例保存通话录音的方法应用于终端,如图1所示,包括:
步骤110,在通话录音过程中,记录所述通话录音的音频数据和多种信息;
本实施例以移动终端为例。在移动终端进行通话时,用户手动选择或自动开始通话录音,在通话录音过程中,移动终端实时记录通话录音的音频数据,如保存在缓存中。记录的音频数据可以是PCM(脉冲编码调制:Pulse Code Modulation)音频数据,但本发明并不局限于此。
在通话录音进行时及结束时,移动终端可以获取通话录音的多种信息。所述多种信息如可以包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
步骤120,将所述音频数据和多种信息保存到文件中,其中,将所述多种信息保存到所述文件的显示内容中。
本实施例中,可以将所述音频数据和多种信息保存到同一文件中。
但在另一实施例中,将所述音频数据和多种信息保存到多个文件中。为了将所述音频数据和多种信息相关联,所述多个文件的文件名至少部分相同。音频数据和多种信息保存到多个文件的方式,包括:多种信息保存到图片文件的显示内容中,而音频数据用音频文件保存;或者,多个文件中的每一文件分别保存所述录音的一部分数据,且多个文件均保存有所述录音的多种信息。
上述保存音频数据和多种信息的文件可以是具有显示内容并可以保存音频数据的文件,在后续的实施例二和实施例三中,分别以图片文件和视频文件为例进行说明。
在图片的显示内容中,多种信息通常可以显示为文本的形式,但也可以显示为其他形式,如通话时间也可以显示为相应的时钟图案,通话地点也可以显示为相应的地图,等等。
可以在通话录音结束后,生成一文件,将缓存的音频数据和多种信息保存在该文件中,也可以在通话录音开始后即生成一临时的文件,将采集的音频数据和多种信息保存在该文件中,通话录音结束后再生成正式文件。
本发明实施例还提供了一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行上述的方法。
本实施例还提供了一种可保存通话录音的终端,可以是手机等移动终端但不局限于此。如图2所示,该终端包括:
通话录音模块10,设置为在通话录音过程中,记录所述通话录音的音频数据和多种信息;
录音保存模块20,设置为将所述音频数据和多种信息保存到文件中,其中,将所述多种信息保存到所述文件的显示内容中。
可选地,
所述通话录音的多种信息包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
通过将通话录音的多种信息保存在文件的显示内容中,避免了相关技术在文件名中保存通话录音信息时受到的局限,显示内容(如图片显示内容、视频显示内容)本身可以记录很多通话时的相关信息,丰富了通话录音的内容。有利于帮助用户回忆起此通话录音的具体场景,从而改善用户体验。
实施例二
本实施例将通话录音的多种信息保存在同一图片文件中。
本实施例保存通话录音的方法如图3所示,包括:
步骤210,在通话录音过程中,记录所述通话录音的音频数据和多种信息;
本步骤同步骤110。
步骤220,将所述多种信息保存到所述图片文件的显示内容中,将所述音频数据填充到所述图片文件的指定保留段中。
本步骤中,如果指定保留段专用于保存所述音频数据,则根据所述指定保留段是否为空即可判断是否存在音频数据。如果非专用于音频数据,则需要在指定保留段的指定位置添加录音标识符,所述录音标识符用于标识本图片文件为保存通话录音的图片文件。所述指定位置可以是指定保留段数据的首位置和/或尾位置。
将所述音频数据填充到所述图片文件的指定保留段中时,比较音频数据和指定保留段的大小,如果音频数据可以填充到一个指定保留段中,则直接将所述音频数据填充到图片文件的一个指定保留段中;如果音频数据大于指定保留段的大小,需要将所述音频数据分片后,分别填充到图片文件的多个指定保留段中。
在一个示例中,图片文件采用联合图片专家小组(JPEG:Joint Photographic Experts Group)制定的图片格式如JPEG,JPEG2000等格式。所述指定保留段包括一个或多个APP15段,也即将通话录音的音频数据填充到JPEG图片的APP15段。但要说明的是,对于图片格式本发明并不限制,也可以采用其他格式。图片的显示内容可以是静态,也可以是动态的。
通话录音保存完毕后,需要向用户展示所述多种信息并实现对音频数据的播放。本实施例生成的图片文件在打开后还需要对其中的音频数据做提取和播放的处理,这将在实施例四说明。
在另一实施例中,可以将通话录音保存在特定的图片文件中(非通用格式),使用特定的应用来打开。此时,可以另行定义该图片文件保存音频数据的位置,不需要保存在保留段中。
本发明实施例还提供了一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行上述的方法。
本实施例还提供一种可保存通话录音的终端,包括:
通话录音模块,设置为在通话录音过程中,记录所述通话录音的音频数据和多种信息;
录音保存模块,设置为将所述多种信息保存到所述图片文件的显示内容中,将所述音频数据填充到所述图片文件的指定保留段中。
可选地,
所述录音保存模块将所述音频数据和多种信息保存到一文件中,还包括:在所述指定保留段的指定位置添加录音标识符,所述录音标识符用于标识本图片文件为保存通话录音的图片文件。
可选地,
所述录音保存模块将所述音频数据填充到所述图片文件的指定保留段中,包括:将所述音频数据填充到所述图片文件的一个指定保留段中;或者,将所述音频数据分片后,分别填充到所述图片文件的多个指定保留段中。
可选地,
所述图片文件采用联合图片专家小组JPEG制定的图片格式,所述指定保留段包括一个或多个APP15段。
可选地,
所述通话录音的多种信息包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
本实施例将通话录音的多种信息保存在图片文件的显示内容中,可以直观地以图片的形式展现通话时间、通话地点、主被叫号码和通话时长等信息。从而帮助用户回忆起此通话录音的具体场景,改善了用户体验
由于本实施例的图片文件与普通的图片文件不同,即使拷贝到别的手机 上也不能正常播放。不熟悉采用本方案手机的可能仅仅以为某通话录音是一张图片而已,这在一定程度上进行了浅度的加密。
实施例三
本实施例将通话录音的多种信息保存在同一视频文件中。
本实施例保存通话录音的方法如图4所示,包括:
步骤310,在通话录音过程中,记录所述通话录音的音频数据和多种信息;
本步骤同步骤110。
步骤320,将所述多种信息保存到一帧或多帧图像的显示内容中,再将所述一帧或多帧图像与所述音频数据拼接,生成所述视频文件。
本步骤生成的视频文件的音频数据即为通话录音的音频数据,图像数据包含通话录音的通话时间、通话号码、通话时长、通话位置等多种信息。这样既扩展了通话录音,又保留了通话录音的通用性。本实施例生成的视频文件是普通的视频文件,即使更换了移动终端,依然可以正常播放。
根据本实施例的方法,用户打开保存通话录音的音频文件时,将播放包含通话时间、主被叫号码和通话时长等信息的图像帧,同时播放通话录音的音频内容。
本发明实施例还提供了一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行上述的方法。
本实施例还提供一种可保存通话录音的终端,包括:
通话录音模块,设置为在通话录音过程中,记录所述通话录音的音频数据和多种信息;
录音保存模块,将所述多种信息保存到一帧或多帧图像的显示内容中,再将所述一帧或多帧图像与所述音频数据拼接,生成所述视频文件。
可选地,
所述通话录音的多种信息包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
实施例四
本实施例提供一种显示和播放图片形式通话录音的方法,可以对实施例二保存通话录音的图片文件进行显示和播放。
本实施例方法的流程如图5所示,包括:
步骤410,接收到打开图片文件的指令后,解析所述图片文件的内容;
步骤420,确定所述图片文件是否保存通话录音的图片文件,如是,执行步骤440,如否,执行步骤430;
本实施例中,通过以下方式确定所述图片文件是否保存通话录音的图片文件:解析所述图片文件的指定保留段,如所述指定保留段的指定位置有录音标识符,则确定所述图片文件是保存通话录音的图片文件;如所述指定保留段的指定位置没有录音标识符,确定所述图片文件不是保存通话录音的图片文件。
步骤430,显示图片的内容,结束;
步骤440,显示所述图片文件的显示内容,并从所述图片文件的其他内容中提取所述通话录音的音频数据进行播放;其中,所述显示内容包含所述通话录音的多种信息。
本实施例中,是从所述指定保留段中提取所述通话录音的音频数据进行播放。如有一个所述指定保留段的指定位置有录音标识符,从该指定保留段中提取所述通话录音的音频数据并进行播放;如有多个所述指定保留段的指定位置有录音标识符,从该多个指定保留段中分别提取所述音频数据,拼接为一整段数据后进行播放。
在一示例中,所述图片文件采用联合图片专家小组JPEG制定的图片格式,所述指定保留段包括一个或多个APP15段。
可选的,本实施例的上述方法中,以下一种或多种处理是所述终端操作 系统框架层的图片解析库执行的:
解析所述图片文件的指定保留段,如所述指定保留段的指定位置有录音标识符,则确定所述图片文件是保存通话录音的图片文件;
从所述指定保留段中提取所述通话录音的音频数据进行播放;其中,对音频数据进行播放是将音频数据传送给音频播放器进行播放。
通过系统框架层的图片解析库执行上述操作,就可以使用常用的图片应用来打开保存通话录音的图片文件并实现对音频的播放。但本发明不局限于此,在另一实施方式中,上述处理也可以由应用执行,也即提供一个特定的应用,该应用可以实现对保存通话录音的图片文件的图片显示和音频播放,此时无需对操作系统框架层的图片解析库进行改动。
所述通话录音的多种信息包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
根据本实施例的方法,用户打开保存通话录音的图片文件时,将以图片的形式展现通话时间、主被叫号码和通话时长等信息,通话录音的内容则以音频的形式播放。
在另一实施例中,是使用特定的图片文件作为保存通话录音的图片文件。此时,可以通过解析图片文件格式本身而获知该图片文件是保存通话录音的图片文件,而该图片文件中音频数据的保存位置可在文件格式中约定,不一定要保存在保留段中。
本发明实施例还提供了一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行上述的方法。
本实施例还提供了一种可播放图片形式通话录音的终端,如图6所示,包括:
图片解析模块50,设置为在接收到打开图片文件的指令后,解析所述图片文件的内容;
图片显示和播放模块60,设置为在所述图片解析模块确定所述图片文件 是保存通话录音的图片文件时,显示所述图片文件的显示内容,并从所述图片文件的其他内容中提取所述通话录音的音频数据进行播放;其中,所述显示内容包含所述通话录音的多种信息。
可选地,
所述图片解析模块解析所述图片文件的内容,包括:解析所述图片文件的指定保留段,如所述指定保留段的指定位置有录音标识符,则确定所述图片文件是保存通话录音的图片文件;
所述图片显示和播放模块从所述图片文件的其他内容中提取所述通话录音的音频数据进行播放,包括:从所述指定保留段中提取所述通话录音的音频数据进行播放。
可选地,
所述图片显示和播放模块从所述指定保留段中提取所述通话录音的音频数据进行播放,包括:
如有一个所述指定保留段的指定位置有录音标识符,从该指定保留段中提取所述通话录音的音频数据并进行播放;
如有多个所述指定保留段的指定位置有录音标识符,从该多个指定保留段中分别提取所述音频数据,拼接为一整段数据后进行播放。
可选地,
所述图片文件采用联合图片专家小组JPEG制定的图片格式,所述保留段包括APP15段。
可选地,
所述图片解析模块通过操作系统框架层的图片解析库或应用执行以下处理:解析所述图片文件的指定保留段,如所述指定保留段的指定位置有录音标识符,则确定所述图片文件是保存通话录音的图片文件;和/或
所述图片显示和播放模块通过操作系统框架层的图片解析库或应用完成以下处理:从所述指定保留段中提取所述通话录音的音频数据进行播放;其中,对音频数据进行播放是将音频数据传送给音频播放器进行播放。
可选地,
所述通话录音的多种信息包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
虽然上述实施例均以通话录音为例,但本发明并不局限于通话录音,对于其他录音,也可以用上述实施例的方法来保存和打开,也可以记录很多说话时(不一定是用户通过终端之间的通话)的相关信息,丰富录音的内容。帮助用户回忆起此录音的具体场景。
下面再用几个具体应用中的示例进行说明。
示例一
本示例将通话录音保存为JEPG图片格式,包括:
步骤一、用户在通话时,选择录音,此时移动终端记录开始通话录音时间及通话录音的PCM音频数据;
步骤二、通话结束时,生成一JPEG图片文件,将通话时间、主被叫号码和通话时长等录音通话的多种信息保存在图片的显示内容中;
例如,在android系统上,可以调用Canvas类将通话时间、主被叫号码和通话时长等信息保存在JPEG图片的显示内容中;在IOS系统上,则可以调用Core Graphics将这些信息保存在JPEG图片的显示内容中。
步骤三、在生成的JPEG图片文件的APP15段,填充PCM音频数据。
JPEG定义的部分标记(Marker)见表1:
表1
Figure PCTCN2015098960-appb-000001
Figure PCTCN2015098960-appb-000002
JPEG标准保留段很多,以上只列出几种。其中的APP 0-15段是JPEG标准预留给应用程序自己使用的标记,比较适合于填充音频数据。
JPEG APP1-15分区的定义见表2:
表2
Figure PCTCN2015098960-appb-000003
可以看到,分区数据段最大不超过64k字节,在录音数据超过64k字节大小时,可以采用定义多个APP15段,将音频数据分片后,分别保存在该多个APP15段。可选地,在JPEG图片文件生成完毕后,可以设置一个阀值, 该阀值小于APP15分区的最大宽度64k字节。将通话录音的PCM音频数据的大小与该阀值进行比较,若不超过该阈值,则将此PCM录音数据存储到JPEG图片文件的一个APP15段中;若超过该阈值,则将此PCM音频数据按顺序分片,每个分片的大小均小于等于该阈值,分片后,将每一分片数据依次保存到一个APP15段中。
在每一个APP15段中,用设定的录音标识符标记PCM音频数据。该录音标识符可以为一段特定的字符串:=_Record_B38787EHK79EH67IKJ,该录音标识符在APP15分区的“详细信息”的首、尾均出现,PCM音频数据填充在该两个录音标识符之间,如图7所示。用于标志PCM音频数据的录音标识符可以将音频数据与其他数据区分开来。
示例二
本示例提供一种显示和播放图片形式通话录音的方法,包括:
步骤一、在终端上打开JPEG图片文件,查找该JPEG图片文件中APP15段是否存在录音标识符,以确定该JPEG图片文件是否为保存通话录音的JPEG图片文件;
本示例中,如果在APP15分区的“详细信息”部分的头和尾都存在录音标识符,则此图片为通话录音。在android系统上,可以在操作系统框架层的JPEG图片解析库中加入对APP15分区的特殊处理,读到APP15段的分区标记即在其中查找录音标识符,如果在APP15分区“详细信息”部分的头和尾均找到录音标识符,则可以确定该JPEG图片文件是保存通话录音的JPEG图片文件。操作系统框架层的JPEG图片解析库可以通知上层应用,该JPEG图片文件是保存通话录音的JPEG图片文件。
如果不是通话录音,则正常打开JPEG图片文件,显示图片。
步骤二、如果存在录音标识符,说明该JPEG图片文件是保存通话录音的JPEG图片文件,首先正常显示图片的显示内容(这里包含了通话时间、主被叫号码和通话时长等多种信息),同时提取隐藏在APP15段中的录音数据,以音频的形式播放给用户。
提取隐藏在APP15段中的录音数据时,需查找APP15的段数,如果有多段,需逐段提取PCM音频数据,并将PCM音频数据拼接为一整段数据后播放。在android系统上,可以在操作系统框架层的JPEG解析逻辑中完成拼接的工作。在解析完毕后,将PCM音频数据流传送给mediaplayer进行播放。在框架层中完成这些动作,可以减少上层应用的工作。
应当说明的是,上述对APP15段的特殊处理也可以在上层应用中完成。
示例三
本示例将通话录音保存为视频文件。该方法包括:
步骤一、用户在通话时,选择录音,此时移动终端记录开始通话录音时间并开始记录通话录音的PCM音频数据;
步骤二、通话结束时,生成一帧图像,将通话时间、通话号码、通话时长、通话位置等信息保存到该帧图像中;
在另一示例中,也可以生成多帧图像来保存这些信息。
步骤三、将步骤二生成的该帧图像和通话录音的PCM音频数据进行拼接,生成一视频文件。
在一个示例中,拼接为MPEG4视频文件,将生成的该帧图像作为第一帧,将PCM音频数据作为音轨,按照PCM数据的时间长度生成同样长度的视频,在时间维度上,逐帧插值。
本示例将通话录音的音频数据作为生成的视频文件的音频内容,视频内容则用通话时间、主被叫号码和通话时长等信息构建的图像帧填充;完成两者的拼接之后,在播放此视频的时候,除了正常的播放通话录音的音频内容,还可以直观显示通话时间、主被叫号码和通话时长等多种信息。
本领域普通技术人员可以理解上述方法中的全部或部分步骤可通过程序来指令相关硬件(例如处理器)完成,所述程序可以存储于计算机可读存储 介质中,如只读存储器、磁盘或光盘等。可选地,上述实施例的全部或部分步骤也可以使用一个或多个集成电路来实现。相应地,上述实施例中的各模块/单元可以采用硬件的形式实现,例如通过集成电路来实现其相应功能,也可以采用软件功能模块的形式实现,例如通过处理器执行存储于存储器中的程序/指令来实现其相应功能。本发明不限制于任何特定形式的硬件和软件的结合。
本领域的普通技术人员应当理解,可以对本发明的技术方案进行修改或者等同替换,而不脱离本发明技术方案的精神和范围,均应涵盖在本发明的权利要求范围当中。
工业实用性
上述技术方案避免了相关技术在文件名中保存通话录音信息时受到的局限,显示内容(如图片显示内容、视频显示内容)本身可以记录很多通话时的相关信息,丰富了通话录音的内容。有利于帮助用户回忆起此通话录音的具体场景,从而改善用户体验。

Claims (36)

  1. 一种应用于终端的保存录音的方法,包括:
    在录音过程中,记录所述录音的音频数据和多种信息;
    将所述音频数据和多种信息保存到文件中,其中,将所述多种信息保存到所述文件的显示内容中。
  2. 如权利要求1所述的方法,其中:
    将所述音频数据和多种信息保存到文件中,包括:将所述音频数据和多种信息保存到同一文件中。
  3. 如权利要求2所述的方法,其中:
    所述文件为图片文件;
    将所述音频数据和多种信息保存到同一文件中,包括:将所述多种信息保存到所述图片文件的显示内容中,将所述音频数据填充到所述图片文件的指定保留段中。
  4. 如权利要求3所述的方法,
    将所述音频数据和多种信息保存到同一文件中,还包括:在所述指定保留段的指定位置添加录音标识符,所述录音标识符用于标识本图片文件为保存录音的图片文件。
  5. 如权利要求3所述的方法,其中:
    将所述音频数据填充到所述图片文件的指定保留段中,包括:将所述音频数据填充到所述图片文件的一个指定保留段中;或者,将所述音频数据分片后,分别填充到所述图片文件的多个指定保留段中。
  6. 如权利要求3或4或5所述的方法,其中:
    所述图片文件采用联合图片专家小组JPEG制定的图片格式,所述指定保留段包括一个或多个APP15段。
  7. 如权利要求2所述的方法,其中:
    所述文件为视频文件;
    将所述音频数据和多种信息保存到同一文件中,包括:将所述多种信息保存到一帧或多帧图像的显示内容中,再将所述一帧或多帧图像与所述音频数据拼接,生成所述视频文件。
  8. 如权利要求1所述的方法,其中:
    将所述音频数据和多种信息保存到文件中,包括:将所述音频数据和多种信息保存到多个文件中,所述多个文件的文件名至少部分相同。
  9. 如权利要求1-5、7-8中任一所述的方法,其中:
    所述多种信息包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
  10. 如权利要求1-5、7-8中任一所述的方法,其中:
    所述录音为通话录音。
  11. 一种应用于终端的显示和播放图片形式录音的方法,包括:
    接收到打开图片文件的指令后,解析所述图片文件的内容;
    如所述图片文件是保存录音的图片文件,显示所述图片文件的显示内容,并从所述图片文件中提取所述录音的音频数据进行播放;其中,所述显示内容包含所述录音的多种信息。
  12. 如权利要求11所述的方法,其中:
    解析所述图片文件的内容,包括:解析所述图片文件的指定保留段,如所述指定保留段的指定位置有录音标识符,则所述图片文件是保存录音的图片文件;
    从所述图片文件的其他内容中提取所述录音的音频数据进行播放,包括:从所述指定保留段中提取所述录音的音频数据进行播放。
  13. 如权利要求12所述的方法,其中:
    从所述指定保留段中提取所述录音的音频数据进行播放,包括:
    如有一个所述指定保留段的指定位置有录音标识符,从该指定保留段中提取所述录音的音频数据并进行播放;
    如有多个所述指定保留段的指定位置有录音标识符,从该多个指定保留段中分别提取所述音频数据,拼接为一整段数据后进行播放。
  14. 如权利要求11-13中任一所述的方法,其中:
    所述图片文件采用联合图片专家小组JPEG制定的图片格式,所述指定保留段包括一个或多个APP15段。
  15. 如权利要求11-13中任一所述的方法,其中:
    以下一种或多种处理是所述终端操作系统框架层的图片解析库执行的,或是终端中的应用执行的:
    解析所述图片文件的指定保留段,如所述指定保留段数据的指定位置有录音标识符,则确定所述图片文件是保存录音的图片文件;
    从所述指定保留段中提取所述录音的音频数据进行播放;其中,对音频数据进行播放是将音频数据传送给音频播放器进行播放。
  16. 如权利要求11-13中任一所述的方法,其中:
    所述录音的多种信息包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
  17. 如权利要求11-13中任一所述的方法,其中:
    所述录音为通话录音。
  18. 一种可保存录音的终端,包括:
    录音模块,设置为在录音过程中,记录所述录音的音频数据和多种信息;
    录音保存模块,设置为将所述音频数据和多种信息保存到文件中,其中,将所述多种信息保存到所述文件的显示内容中。
  19. 如权利要求18所述的终端,其中:
    所述录音保存模块是设置为通过如下方式实现将所述音频数据和多种信息保存到文件中:将所述音频数据和多种信息保存到同一文件中。
  20. 权利要求19所述的终端,其中:
    所述文件为图片文件;
    所述录音保存模块是设置为通过如下方式实现将所述音频数据和多种信息保存到同一文件中:将所述多种信息保存到所述图片文件的显示内容中,将所述音频数据填充到所述图片文件的指定保留段中。
  21. 如权利要求20所述的终端,:
    所述录音保存模块还设置为,将所述音频数据和多种信息保存到同一文件时,在所述指定保留段的指定位置添加录音标识符,所述录音标识符用于标识本图片文件为保存录音的图片文件。
  22. 权利要求20所述的终端,其中:
    所述录音保存模块是设置为通过如下方式将所述音频数据填充到所述图片文件的指定保留段中:将所述音频数据填充到所述图片文件的一个指定保留段中;或者,将所述音频数据分片后,分别填充到所述图片文件的多个指定保留段中。
  23. 如权利要求20或21或22所述的终端,其中:
    所述图片文件采用联合图片专家小组JPEG制定的图片格式,所述指定保留段包括一个或多个APP15段。
  24. 权利要求19所述的终端,其中:
    所述文件为视频文件;
    所述录音保存模块是设置为通过如下方式实现将所述音频数据和多种信息保存到同一文件中:
    将所述多种信息保存到一帧或多帧图像的显示内容中,再将所述一帧或多帧图像与所述音频数据拼接,生成所述视频文件。
  25. 如权利要求18所述的终端,其中:
    所述录音保存模块是设置为通过如下方式实现保存所述音频数据和多种信息:
    将所述音频数据和多种信息保存到多个文件中,所述多个文件的文件名至少部分相同。
  26. 如权利要求18-22、24-25中任一所述的终端,其中:
    所述录音的多种信息包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
  27. 权利要求18-22、24-25中任一所述的终端,其中:
    所述录音为通话录音。
  28. 一种可播放图片形式录音的终端,包括:
    图片解析模块,设置为在接收到打开图片文件的指令后,解析所述图片文件的内容;
    图片显示和播放模块,设置为在所述图片解析模块确定所述图片文件是保存录音的图片文件时,显示所述图片文件的显示内容,并从所述图片文件的其他内容中提取所述录音的音频数据进行播放;其中,所述显示内容包含所述录音的多种信息。
  29. 权利要求28所述的终端,其中:
    所述图片解析模块是设置为通过如下方式实现解析所述图片文件的内容:解析所述图片文件的指定保留段,如所述指定保留段的指定位置有录音标识符,则确定所述图片文件是保存录音的图片文件;
    所述图片显示和播放模块是设置为通过如下方式实现从所述图片文件的其他内容中提取所述录音的音频数据进行播放:从所述指定保留段中提取所述录音的音频数据进行播放。
  30. 如权利要求29所述的终端,其中:
    所述图片显示和播放模块是设置为通过如下方式实现从所述指定保留段中提取所述录音的音频数据进行播放:
    如有一个所述指定保留段的指定位置有录音标识符,从该指定保留段中提取所述录音的音频数据并进行播放;
    如有多个所述指定保留段的指定位置有录音标识符,从该多个指定保留段中分别提取所述音频数据,拼接为一整段数据后进行播放。
  31. 如权利要求28-30中任一所述的终端,其中:
    所述图片文件采用联合图片专家小组JPEG制定的图片格式,所述保留段包括APP15段。
  32. 如权利要求28-30中任一所述的终端,其中:
    所述图片解析模块是设置为通过操作系统框架层的图片解析库或应用执行以下处理:解析所述图片文件的指定保留段,如所述指定保留段的指定位置有录音标识符,则确定所述图片文件是保存录音的图片文件;和/或
    所述图片显示和播放模块是设置为通过操作系统框架层的图片解析库或应用完成以下处理:从所述指定保留段中提取所述录音的音频数据进行播放;其中,对音频数据进行播放是将音频数据传送给音频播放器进行播放。
  33. 如权利要求28-30中任一所述的终端,其中:
    所述录音的多种信息包括以下信息中的至少二种:通话开始时间、通话结束时间、主叫号码、被叫号码、通话时长、主叫位置、被叫位置、主叫联系人、被叫联系人,及经语音识别得到的全部或部分通话内容。
  34. 如权利要求28-30中任一所述的终端,其中:
    所述录音为通话录音。
  35. 一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行权利要求1~10中任一项所述的方法。
  36. 一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行权利要求11~17中任一项所述的方法。
PCT/CN2015/098960 2015-10-09 2015-12-25 一种保存录音、显示和播放图片形式录音的方法、终端 WO2016184109A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510648464.8A CN106572229A (zh) 2015-10-09 2015-10-09 一种保存录音、显示和播放图片形式录音的方法、终端
CN201510648464.8 2015-10-09

Publications (1)

Publication Number Publication Date
WO2016184109A1 true WO2016184109A1 (zh) 2016-11-24

Family

ID=57319294

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/098960 WO2016184109A1 (zh) 2015-10-09 2015-12-25 一种保存录音、显示和播放图片形式录音的方法、终端

Country Status (2)

Country Link
CN (1) CN106572229A (zh)
WO (1) WO2016184109A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111090757A (zh) * 2019-11-25 2020-05-01 维沃移动通信有限公司 一种多媒体文件显示方法、电子设备和存储介质
CN112019484A (zh) * 2019-05-31 2020-12-01 阿里巴巴集团控股有限公司 获取音源数据的方法及相关设备

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108737894A (zh) * 2018-06-06 2018-11-02 北京酷我科技有限公司 一种由图片合成视频的方法

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1285698A (zh) * 1999-08-20 2001-02-28 三星电子株式会社 在带有摄像头的移动电话中编辑带声音的照片的方法
US20070116199A1 (en) * 2005-11-17 2007-05-24 Juha Arrasvuori Method, mobile device, system and software for establishing an audio note journal
CN101282375A (zh) * 2008-04-07 2008-10-08 中兴通讯股份有限公司 一种基于通话记录映射多媒体文件的方法和移动终端
US20120063573A1 (en) * 2008-12-23 2012-03-15 Rockstar Bidco, LP Accessing recorded conference content
CN104333641A (zh) * 2014-09-26 2015-02-04 小米科技有限责任公司 通话方法及装置
CN105049582A (zh) * 2015-07-28 2015-11-11 努比亚技术有限公司 一种通话录音的保存装置、方法和显示方法

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101656814A (zh) * 2008-08-18 2010-02-24 爱思开电讯投资(中国)有限公司 用于将声音文件添加到jpeg文件中的方法及装置
CN102609968B (zh) * 2012-03-05 2015-06-24 深圳市优利麦克科技开发有限公司 实现有声图片的方法及系统
CN103327277A (zh) * 2013-07-05 2013-09-25 成都品果科技有限公司 留声照片生成方法及图片数据与声音数据合并存储方法
CN104751868B (zh) * 2013-12-31 2018-12-11 海能达通信股份有限公司 语音录制方法、通话录音回放方法以及相关装置和系统

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1285698A (zh) * 1999-08-20 2001-02-28 三星电子株式会社 在带有摄像头的移动电话中编辑带声音的照片的方法
US20070116199A1 (en) * 2005-11-17 2007-05-24 Juha Arrasvuori Method, mobile device, system and software for establishing an audio note journal
CN101282375A (zh) * 2008-04-07 2008-10-08 中兴通讯股份有限公司 一种基于通话记录映射多媒体文件的方法和移动终端
US20120063573A1 (en) * 2008-12-23 2012-03-15 Rockstar Bidco, LP Accessing recorded conference content
CN104333641A (zh) * 2014-09-26 2015-02-04 小米科技有限责任公司 通话方法及装置
CN105049582A (zh) * 2015-07-28 2015-11-11 努比亚技术有限公司 一种通话录音的保存装置、方法和显示方法

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112019484A (zh) * 2019-05-31 2020-12-01 阿里巴巴集团控股有限公司 获取音源数据的方法及相关设备
CN112019484B (zh) * 2019-05-31 2024-06-07 阿里巴巴集团控股有限公司 获取音源数据的方法及相关设备
CN111090757A (zh) * 2019-11-25 2020-05-01 维沃移动通信有限公司 一种多媒体文件显示方法、电子设备和存储介质

Also Published As

Publication number Publication date
CN106572229A (zh) 2017-04-19

Similar Documents

Publication Publication Date Title
US10939069B2 (en) Video recording method, electronic device and storage medium
US6944629B1 (en) Method and device for managing multimedia file
WO2017148442A1 (zh) 一种音视频处理方法和装置、计算机存储介质
CN111899322B (zh) 视频处理方法、动画渲染sdk和设备及计算机存储介质
CN106792152B (zh) 一种视频合成方法及终端
WO2017157276A1 (zh) 多媒体文件的拼接方法和装置
CN106507200B (zh) 视频播放内容插入方法和系统
WO2016184109A1 (zh) 一种保存录音、显示和播放图片形式录音的方法、终端
CN101106770A (zh) 一种手机上制作带背景音乐的拍照动画的方法
KR100604831B1 (ko) 오디오에 부가 영상과 문자를 동기시켜 재생하는오디오/비디오 재생 장치 및 그 방법
CN104219555A (zh) 一种安卓系统终端中的视频显示装置和方法
US9083786B2 (en) Electronic device for identifying a party
WO2015018119A1 (zh) 一种多媒体文件生成的方法及多媒体设备
CN111913641A (zh) 一种实现图片语音化的方法和系统
RU2006113931A (ru) Устройство и способ отображения мультимедийных данных, объединенных с текстовыми данными, и носитель записи, содержащий программу для выполнения этого способа
JP6871388B2 (ja) オーディオまたはビデオ内のカット間タイムバケットを決定するための方法および装置
CN108966000B (zh) 播放方法及其装置、介质、终端
KR100389851B1 (ko) 메뉴화면을위한섬네일영상을작성하기에적합한디스크기록매체
WO2018076899A1 (zh) 一种数据切换方法、装置、终端及计算机可读存储介质
MXPA04010659A (es) Aparato de reproduccion de imagen en movimiento en el que se coloca la informacion en el modo de reproductor, metodo de reproduccion que utiliza el mismo y medio de almacenamiento.
JP2008035535A (ja) 対話型光ディスクのアニメーションデータ管理方法及び装置
JP2005204338A (ja) 携帯電話や携行端末における静止画漫画の再生方法
US20230229689A1 (en) Media file generation apparatus, media file playback apparatus, media file generation method, media file playback method, program, and storage medium
TW587213B (en) Playing method of video and audio
JP6129085B2 (ja) 放送受信装置、番組内容確認用データ作成処理装置、及び番組録画装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15892483

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15892483

Country of ref document: EP

Kind code of ref document: A1