CN109325219A

CN109325219A - A kind of method, apparatus and system generating recording documents

Info

Publication number: CN109325219A
Application number: CN201810970787.2A
Authority: CN
Inventors: 黄滨
Original assignee: Vivo Mobile Communication Co Ltd
Current assignee: Vivo Mobile Communication Co Ltd
Priority date: 2018-08-24
Filing date: 2018-08-24
Publication date: 2019-02-12
Anticipated expiration: 2038-08-24
Also published as: CN109325219B

Abstract

The embodiment of the present invention provides a kind of method, apparatus and system for generating recording documents, is related to field of terminal technology, for solving the problems, such as that the process for obtaining recording documents is very complicated.This method comprises: obtaining the M target image and M target audio from audio-visual acquisition equipment, the target audio is the audio that the audio-visual acquisition equipment records ambient sound acquisition in the target time period, at the time of the target time section is at the time of executing Image Acquisition next time to the audio-visual acquisition equipment at the time of this executes Image Acquisition from the audio-visual acquisition equipment or stops recording ambient sound period；The M target image is synthesized with the M target audio respectively, generates M destination document, recording documents are generated according to the M destination document；Wherein, M is the integer greater than 1.The embodiment of the present invention is for generating recording documents.

Description

A kind of method, apparatus and system generating recording documents

Technical field

The present invention relates to field of terminal technology more particularly to a kind of method, apparatus and system for generating recording documents.

Background technique

With the development of terminal technology, the terminal devices such as mobile phone will be used wider and wider it is general, wherein being set using terminal Standby shooting function, sound-recording function etc. carry out the weight that record has become terminal device under the scenes such as classroom, speech, meeting Want purposes.

It is frequently encountered the needs such as PPT displaying, a large amount of writings on the blackboard under the scenes such as classroom, speech, meeting and frequently shoots image The case where record.Currently, in this case, user obtains the mode of recording documents are as follows: by terminal device in real time to environment In sound be acquired, and in one PPT of every conversion or image is shot by terminal device when often writing a writing on the blackboard, After the end such as classroom, speech, meeting, audio and shooting image to recording carry out typesetting and generate corresponding record document.On State generation record file during, user not only will frequently shoot image in classroom, speech, meeting, and in classroom, drill It saying, after the end such as meeting, user also needs to make extensive work and carries out integration typesetting to the picture of shooting and the audio of recording, because The process that this obtains recording documents is very complicated.

Summary of the invention

The embodiment of the present invention provides a kind of method, apparatus and system for generating recording documents, obtains record text for solving The very complicated problem of the process of shelves.

In order to solve the above-mentioned technical problem, the present invention is implemented as follows:

In a first aspect, it is applied to terminal device the embodiment of the invention provides a kind of method for generating recording documents, it is described Method includes:

M target image and M target audio from audio-visual acquisition equipment are obtained, the target audio is the shadow Sound acquisition equipment records the audio of ambient sound acquisition in the target time period, and the target time section is to set from the audio-visual acquisition At the time of standby this executes Image Acquisition to the audio-visual acquisition equipment at the time of executing Image Acquisition next time or stop recording Period between at the time of ambient sound；

The M target image is synthesized with the M target audio respectively, M destination document is generated, according to institute It states M destination document and generates recording documents；

Wherein, M is the integer greater than 1.

Second aspect, the embodiment of the invention provides a kind of terminal devices, comprising:

Acquiring unit, for obtaining M target image and M target audio from audio-visual acquisition equipment, the mesh Mark with phonetic symbols frequency be it is described it is audio-visual acquisition equipment record in the target time period ambient sound acquisition audio, the target time section be from The audio-visual acquisition equipment this execute Image Acquisition at the time of to next time execute Image Acquisition at the time of or stop recording ring Period between at the time of the sound of border；

Generation unit generates M mesh for synthesizing the M target image with the M target audio respectively Document is marked, recording documents are generated according to the M destination document；

Wherein, M is the integer greater than 1.

The third aspect the embodiment of the invention provides a kind of terminal device, including processor, memory and is stored in described It is real when the computer program is executed by the processor on memory and the computer program that can run on the processor Now the step of method of generation recording documents as described in relation to the first aspect.

Fourth aspect, the embodiment of the invention provides it is a kind of generate recording documents system, comprising: audio-visual acquisition device with And terminal device described at least one second aspect.

5th aspect, the embodiment of the invention provides a kind of computer readable storage medium, the computer-readable storage Computer program is stored on medium, and generation record as described in relation to the first aspect is realized when the computer program is executed by processor The step of method of document.

The method provided in an embodiment of the present invention for generating recording documents obtains the M mesh from audio-visual acquisition equipment first Logo image and at the time of from audio-visual acquisition equipment, this executes Image Acquisition to performance objective Image Acquisition next time at the time of Or at the time of stopping recording ambient sound between period in record the M target audio that ambient sound obtains, it is then that the M is a Target image is synthesized with the M target audio respectively, generates M destination document, finally further according to the M target text Shelves generate recording documents, since the method provided in an embodiment of the present invention for generating recording documents can be obtained when obtaining target image Take acquisition target image at the time of to next time execute Image Acquisition at the time of or stop record ambient sound at the time of between mesh Mark with phonetic symbols frequency, and synthesize the target image and the target audio and generate destination document, therefore the embodiment of the present invention only need to be according to The sequence for obtaining destination document, which integrates destination document, can obtain recording documents, and middle obtain is owned compared with the prior art Target image and the whole audio recorded, then again integrate all target images and the whole audio recorded, the present invention Embodiment, which need to only integrate destination document according to the sequence of the target image of acquisition, can obtain recording documents, therefore this hair Bright embodiment can reduce the workload for obtaining recording documents, and then the process for solving to obtain recording documents very complicated is asked Topic.

Detailed description of the invention

Fig. 1 is the architecture diagram of Android operation system provided in an embodiment of the present invention；

Fig. 2 is one of the step flow chart of method provided in an embodiment of the present invention for generating recording documents；

Fig. 3 is the two of the step flow chart of the method provided in an embodiment of the present invention for generating recording documents；

Fig. 4 is one of the application scenarios schematic diagram of method provided in an embodiment of the present invention for generating recording documents；

Fig. 5 is the two of the application scenarios schematic diagram of the method provided in an embodiment of the present invention for generating recording documents；

Fig. 6 is the schematic structure of the system provided in an embodiment of the present invention for generating recording documents；

Fig. 7 is terminal device schematic diagram provided in an embodiment of the present invention；

Fig. 8 is the hardware structural diagram of terminal device provided in an embodiment of the present invention.

Specific embodiment

Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are some of the embodiments of the present invention, instead of all the embodiments.Based on this hair Embodiment in bright, every other implementation obtained by those of ordinary skill in the art without making creative efforts Example, shall fall within the protection scope of the present invention.

The terms "and/or", only a kind of incidence relation for describing affiliated partner, indicates that there may be three kinds of passes System, for example, A and/or B, can indicate: individualism A exists simultaneously A and B, these three situations of individualism B.

Term " first " in description and claims of this specification and " second " etc. are for distinguishing synchronous pair As, rather than it is used for the particular order of description object.For example, first interface and second interface etc. are for distinguishing different connect Mouthful, rather than the particular order for describing interface.

In embodiments of the present invention, " illustrative " or " such as " etc. words for indicate make example, illustration or explanation.This Be described as in inventive embodiments " illustrative " or " such as " any embodiment or design scheme be not necessarily to be construed as comparing Other embodiments or design scheme more preferably or more advantage.Specifically, use " illustrative " or " such as " etc. words purport Related notion is being presented in specific ways.In addition, unless otherwise indicated, " multiples' " contains in the description of the embodiment of the present invention Justice refers to two or more.

In the prior art obtain record file during, user not only will in classroom, speech, meeting frequent shooting figure Picture, and after the end such as classroom, speech, meeting, user also needs to make extensive work to the picture of shooting and the sound of recording Frequency is integrated, therefore the process for obtaining record shelves is very complicated.

In order to solve this problem, the embodiment of the present invention provides a kind of method, apparatus and system for generating recording documents, the life M target image from audio-visual acquisition equipment is obtained first at the method for recording documents and from audio-visual acquisition equipment, this is held Between at the time of at the time of at the time of row Image Acquisition to performance objective Image Acquisition next time or stopping recording ambient sound when Between the M target audio that ambient sound obtains is recorded in section, then by the M target image respectively with the M target audio It is synthesized, generates M destination document, finally generate recording documents further according to the M destination document, since the present invention is implemented To next time at the time of the method for the generation recording documents that example provides can obtain acquisition target image when obtaining target image Target audio between at the time of execution Image Acquisition or at the time of stopping recording ambient sound, and synthesize the target image and institute State target audio generate destination document, therefore the embodiment of the present invention only need to according to obtain destination document sequence to destination document into Row integration can obtain recording documents, middle compared with the prior art to obtain all target images and the whole audio recorded, then All target images and the whole audio recorded are integrated again, the embodiment of the present invention only need to be according to the target image of acquisition Sequence, which integrates destination document, can obtain recording documents, therefore the embodiment of the present invention can reduce and obtain recording documents Workload, and then solve the problems, such as that the process for obtaining recording documents is very complicated.

Terminal device in the embodiment of the present invention can be the terminal device with operating system.The operating system can be Android (Android) operating system can be ios operating system, can also be other possible operating systems, and the present invention is implemented Example is not especially limited.

Below by taking Android operation system as an example, the method institute provided in an embodiment of the present invention for generating recording documents is introduced The software environment of application.

As shown in Figure 1, being a kind of configuration diagram of possible Android operation system provided in an embodiment of the present invention.Scheming In 1, the framework of Android operation system includes 4 layers, be respectively as follows: application layer, application framework layer, system Runtime Library layer and Inner nuclear layer (is specifically as follows Linux inner core).

Wherein, application layer includes each application program (including system application and the mesh in Android operation system Mark application program).

Application framework layer is the frame of application program, and developer can be in the exploitation for the frame for abiding by application program In the case where principle, some application programs are developed based on application framework layer.

System Runtime Library layer includes library (also referred to as system library) and Android operation system running environment.Library is mainly Android behaviour As system it is provided needed for all kinds of resources.Android operation system running environment is used to provide software loop for Android operation system Border.

Inner nuclear layer is the operating system layer of Android operation system, belongs to the bottom of Android operation system software level.It is interior Stratum nucleare provides core system service and hardware-related driver based on linux kernel for Android operation system.

By taking Android operation system as an example, in the embodiment of the present invention, developer can be based on above-mentioned Android as shown in Figure 1 The software program of the method provided in an embodiment of the present invention for generating recording documents is realized in the system architecture of operating system, exploitation, from And the method for the generation recording documents is run based on Android operation system as shown in Figure 1.That is processor or end End can realize generation recording documents provided in an embodiment of the present invention by running the software program in Android operation system Method.

Terminal device in the embodiment of the present invention can be mobile terminal, or immobile terminal.Mobile terminal can Think mobile phone, tablet computer, laptop, palm PC, car-mounted terminal, wearable device, Ultra-Mobile PC (ultra-mobile personal computer, UMPC), net book or personal digital assistant (personal digital Assistant, PDA) etc.；Immobile terminal can be personal computer (personal computer, PC), television set (television, TV), automatic teller machine or self-service machine etc.；The embodiment of the present invention is not especially limited.

Embodiment one,

The embodiment of the present invention provides a kind of method for generating recording documents, specifically, referring to shown in Fig. 2, this method packet It includes:

S11, M target image and M target audio from audio-visual acquisition equipment are obtained.

The target audio is the audio that the audio-visual acquisition equipment records ambient sound acquisition in the target time period, described Target time section be from the audio-visual acquisition equipment this execution Image Acquisition at the time of to the audio-visual acquisition equipment next time Period between at the time of execution Image Acquisition or at the time of stopping recording ambient sound；M is the integer greater than 1.

Such as: 3 target images and 3 target audios from audio-visual acquisition equipment are obtained in total, and are successively obtained Target image be first object image, the second target image and third target image, and at the time of acquire first object image It is the third moment at the time of to be the second moment, acquiring third target image at the time of the first moment, the second target image of acquisition, 3 target audios then successively obtained be record the first moment to the second moment period in ambient sound obtain audio, Record the audio and record the third moment to stopping recording that the ambient sound in the second moment to the period at third moment obtains The audio that ambient sound in period at the time of ambient sound obtains, therefore the M from audio-visual acquisition equipment in above-mentioned steps S11 A target image and M target audio include: to obtain first object image and the first moment from audio-visual acquisition equipment In to the period at the second moment when the audio (first object audio) of recording ambient sound acquisition, the second target image and second Be carved into recorded in period at third moment audio (the second target audio) and third target image that ambient sound obtains and The audio (third target audio) that ambient sound obtains is recorded in the third moment to period at the time of stopping recording ambient sound.

S12, the M target image is synthesized with the M target audio respectively, generates M destination document.

It should be noted that the M target image is carried out with the M target audio respectively in above-mentioned steps S12 When synthesis, need to synthesize target image image with corresponding target audio.It is as above: to obtain from audio-visual acquisition equipment 3 target images and 3 target audios, the acquisition order of 3 target images are followed successively by first object image, the second target figure As and third target image, the acquisition order of 3 target audios be followed successively by first object audio, the second target audio and the Three target audios, then the corresponding target audio of first object image is first object audio, the corresponding target of the second target image Audio is the second target audio, the corresponding target audio of third target image is third target audio, therefore by first object figure As carrying out being synthetically generated a destination document with first object audio, the second target image and the second target audio are synthesized A destination document, third target image and third target audio is generated to carry out being synthetically generated a destination document.

In addition, the mode for synthesizing the target image and the target audio is not construed as limiting in the embodiment of the present invention, and And synthesize the Doctype of the destination document that the target image and the target audio generate also without limitation, with can be to institute It states target image and the target audio is synthetically generated subject to destination document.

S13, recording documents are generated according to the M destination document.

Illustratively, recording documents are generated according to the M destination document, be specifically as follows according in M destination document The genesis sequence of each destination document is ranked up generation recording documents to M destination document, or according to M target text of synthesis The acquisition order of the target image of shelves is ranked up generation recording documents to M destination document.

As above, first object document, the second target figure of synthesis are generated in synthesis first object image and first object audio Picture and the second target audio generate the second destination document and synthesis third target image and third target audio generates third mesh After marking document, record text is generated according to the sequence sequence of first object document, the second destination document and third destination document Shelves.

Optionally, referring to shown in Fig. 3, above-mentioned steps S12 (by the M target image respectively with the M target sound Frequency is synthesized, and M destination document is generated) before, the method provided in an embodiment of the present invention for generating recording documents further include:

S31, the M target image is shown on M display interface respectively.

Optionally, after the target image being shown on any display interface, the method can also include: according to The position of the target image in display interface and size is adjusted in family operation.

Illustratively, referring to shown in Fig. 4, it in display interface may include: target image 41 and taken down notes for receiving Take down notes receiving area 42.

S32, the notes that user inputs at least one display interface of the M display interface are received.

S33, M target interface is generated according to the notes that the M display interface and user input.

It illustratively, referring to Figure 5, include: that target image 41 and user receive in notes in the first display interface The target notes 50 inputted in region 42.

The M target image (is synthesized with the M target audio respectively, generates M mesh by above-mentioned steps S12 Mark document) specifically: the M target interface is synthesized with the M target audio respectively, generates M destination document.

Further displaying target image in the display interface in above-described embodiment, and receive user and input in the display interface Notes, then generate target interface according to the notes that the display interface and user input, be finally synthesizing all targets circle Face and all target audios generate destination document, therefore the notes of user can be added in above-described embodiment in recording documents, Therefore above-described embodiment can further enrich the content of recording documents, to further promote the experience of user.

Under (to obtain M target image from audio-visual acquisition equipment to the step S11 in above-described embodiment and M be a Target audio) implementation be described in detail.

Above-mentioned steps S11 (obtaining M target image and M target audio from audio-visual acquisition equipment) specifically can be with Are as follows:

It controls the audio-visual acquisition device and carries out the M Image Acquisition acquisition M target image in a targeted way, and divide Ambient sound is recorded not in M target time section obtains the M target audio；

Receive the M target image and the M target audio that the audio-visual acquisition device is sent.

It should be noted that referring to shown in Fig. 6, since the equipment of collection target image and the audio for recording target is (audio-visual Acquisition equipment) it is set with the equipment (terminal device) for synthesizing the target image and target audio generation destination document for difference Standby, in order to save hardware resource, the equipment (audio-visual acquisition equipment 61) for acquiring target image and recording target audio can To serve multiple terminal devices (621,622 ... 62n).It is established between multiple terminal devices and audio-visual acquisition equipment 61 wired Or be wirelessly connected, it is audio-visual acquisition equipment 61 by establish wired or wireless connection to each terminal device send target image with And target audio.

Due to acquisition target image and the equipment (audio-visual acquisition equipment) and the synthesis target of the audio for recording target When the equipment (terminal device) of image and target audio generation destination document is distinct device, user is without frequently passing through Terminal device acquires target image and records target audio, therefore can be further simplified the process for obtaining recording documents.

Optionally, the audio-visual acquisition device is controlled in above-described embodiment to carry out in a targeted way described in Image Acquisition acquisition M target image specifically includes following three kinds of implementations:

Implementation one,

The control audio-visual acquisition device carries out M Image Acquisition in a targeted way and obtains the M target image, Include:

The preview image that the audio-visual acquisition device obtains predeterminable area in real time is controlled, and in adjacent two frames preview image When variable quantity is greater than variable quantity threshold value, an Image Acquisition is carried out to the predeterminable area and obtains a target image.

That is, setting the region of Image Acquisition to be carried out, audio-visual acquisition device obtains the area that carry out Image Acquisition in real time The preview image in domain, when the content in the region of Image Acquisition to be carried out changes, audio-visual acquisition device carries out an image Acquisition obtains a target image.

Implementation two,

The audio-visual acquisition device is controlled to carry out described in an Image Acquisition acquisition one predeterminable area in predetermined time Target image.

Illustrative: the control audio-visual acquisition device is using predetermined time period as interval, periodically to predeterminable area It carries out Image Acquisition and obtains the target image.

Implementation three,

Receive the first operation of user's input；

In response to first operation, control controls the audio-visual acquisition device and carries out an Image Acquisition to predeterminable area Obtain a target image.

That is, user is manually operated by oneself, the picture in specified region is shot.

Optionally, it when starting audio-visual acquisition equipment, can be sent out to each terminal device being connect with audio-visual acquisition equipment Prompt information is sent, user is prompted to select the mode of acquisition target image, receives the choosing that user inputs on the terminal device Operation is selected, the selection operation is the choosing to above-mentioned implementation one, implementation two and three kinds of implementation of any one Operation is selected, then in response to the selection operation, determines the mode of acquisition target image, and according to determining acquisition target image Mode acquire target image.

It should be noted that when multiple users are in such a way that selection operation selects acquisition target image, multiple users The mode of the acquisition target image of selection may be different.Such as: user A selection one acquires target image through the above way, and User B selection two acquires target image through the above way, and user A and user B will carry out the region of target image acquisition not Together, user A and user B need to acquire identical at the time of target image, at this time, it is desirable that audio-visual acquisition equipment simultaneously wants user A The region that the region and user B for carrying out target image acquisition will carry out target image acquisition carries out Image Acquisition, however some feelings Under condition, audio-visual acquisition equipment can not carry out Image Acquisition to two regions simultaneously.In this case, due to generally acquiring image Used time all non-length, therefore audio-visual acquisition equipment successively will can carry out the region of target image acquisition and user B to A and carry out The region of target image acquisition carries out Image Acquisition, and respectively adopts the region that target image acquisition is carried out to A progress image The region that target image acquisition is carried out to A is carried out the mesh of Image Acquisition acquisition by the user A that the target image that collection obtains is sent The user B that logo image is sent, can also prompt user A and user B (includes: carry out target image to target image acquisition mode Modify at the time of the region of acquisition and/or progress target image acquisition), it is needed together to avoid the occurrence of audio-visual acquisition equipment When Image Acquisition are carried out to two regions.

Embodiment two,

The embodiment of the present invention provides a kind of terminal device, specifically, referring to shown in Fig. 7, which includes:

Acquiring unit 71, it is described for obtaining M target image and M target audio from audio-visual acquisition equipment Target audio is the audio that the audio-visual acquisition equipment records ambient sound acquisition in the target time period, and the target time section is From the audio-visual acquisition equipment this execute Image Acquisition at the time of to next time execute Image Acquisition at the time of or stop record Period between at the time of ambient sound；

Generation unit 72 synthesizes the M target image with the M target audio respectively, generates M target Document generates recording documents according to the M destination document；

Wherein, M is the integer greater than 1.

Optionally, referring to shown in Fig. 7, the terminal device 700 further include: display unit 73 and receiving unit 74；

The display unit, for showing the M target image on M display interface respectively；

The receiving unit is inputted at least one display interface of the M display interface for receiving user Notes；

The generation unit, specifically for generating M target according to the notes of the M display interface and user's input Interface, and the M target interface is synthesized with the M target audio respectively, generate M destination document.

Optionally, the acquiring unit 71 carries out image specifically for controlling the audio-visual acquisition device in a targeted way Acquisition obtains the M target image, and records ambient sound in M target time section and obtain the M target audio, receives The M target image and the M target audio that the audio-visual acquisition device is sent.

Optionally, the acquiring unit 71 obtains predeterminable area specifically for controlling the audio-visual acquisition device in real time Preview image, and when the variable quantity of adjacent two frames preview image is greater than variable quantity threshold value, the predeterminable area is carried out primary Image Acquisition obtains a target image.

Optionally, the acquiring unit 71 is specifically used for controlling the audio-visual acquisition device in predetermined time to preset areas Domain carries out an Image Acquisition and obtains a target image.

Optionally, the acquiring unit 71, specifically for receiving the first operation of user's input, in response to first behaviour Make, control controls the audio-visual acquisition device and carries out Image Acquisition one target image of acquisition to predeterminable area.

Terminal device provided in an embodiment of the present invention, first obtain from it is audio-visual acquisition equipment M target image and At the time of at the time of this executes Image Acquisition from audio-visual acquisition equipment to performance objective Image Acquisition next time or stop recording The M target audio that ambient sound obtains is recorded in period between at the time of ambient sound, then by the M target image point It is not synthesized with the M target audio, generates M destination document, finally generated and record further according to the M destination document Document, at the time of acquisition target image can be obtained when obtaining target image due to terminal device provided in an embodiment of the present invention Target audio between at the time of executing Image Acquisition next time or at the time of stopping recording ambient sound, and synthesizes the target Image and the target audio generate destination document, therefore the embodiment of the present invention only need to be according to the sequence of acquisition destination document to mesh Mark document, which carries out integration, can obtain recording documents, middle compared with the prior art to obtain all target images and the whole sound recorded Frequently, then all target images and the whole audio recorded are integrated again, the embodiment of the present invention only need to be according to the mesh of acquisition The sequence of logo image, which integrates destination document, can obtain recording documents, therefore the embodiment of the present invention can reduce acquisition note The workload of document is recorded, and then solves the problems, such as that the process for obtaining recording documents is very complicated.

The hardware structural diagram of Fig. 8 a kind of terminal device of each embodiment to realize the present invention, as shown in figure 8, should Terminal device 800 includes but is not limited to: radio frequency unit 101, network module 102, audio output unit 103, input unit 104, Sensor 105, display unit 106, user input unit 107, interface unit 108, memory 109, processor 110, Yi Ji electricity The components such as source 111.It will be understood by those skilled in the art that terminal device structure shown in Fig. 8 is not constituted to terminal device Restriction, terminal device may include perhaps combining certain components or different components than illustrating more or fewer components Arrangement.In embodiments of the present invention, terminal device include but is not limited to mobile phone, tablet computer, laptop, palm PC, Car-mounted terminal, wearable device and pedometer etc..

Wherein, radio frequency unit 101 or interface unit 108 be used to obtain from audio-visual M target image for acquiring equipment with And M target audio, the target audio are the sound that the audio-visual acquisition equipment records ambient sound acquisition in the target time period Frequently, to the audio-visual acquisition equipment at the time of target time section is from this execution Image Acquisition of the audio-visual acquisition equipment Next time execute Image Acquisition at the time of or stop record ambient sound at the time of between period；

Processor 110 synthesizes the M target image with the M target audio respectively, generates M target Document generates recording documents according to the M destination document；

Wherein, M is the integer greater than 1.

It should be understood that the embodiment of the present invention in, radio frequency unit 101 can be used for receiving and sending messages or communication process in, signal Send and receive, specifically, by from base station downlink data receive after, to processor 110 handle；In addition, by uplink Data are sent to base station.In general, radio frequency unit 101 includes but is not limited to antenna, at least one amplifier, transceiver, coupling Device, low-noise amplifier, duplexer etc..In addition, radio frequency unit 101 can also by wireless communication system and network and other set Standby communication.

Terminal device provides wireless broadband internet by network module 102 for user and accesses, and such as user is helped to receive It sends e-mails, browse webpage and access streaming video etc..

Audio output unit 103 can be received by radio frequency unit 101 or network module 102 or in memory 109 The audio data of storage is converted into audio signal and exports to be sound.Moreover, audio output unit 103 can also provide and end The relevant audio output of specific function (for example, call signal receives sound, message sink sound etc.) that end equipment executes.Sound Frequency output unit 103 includes loudspeaker, buzzer and receiver etc..

Input unit 104 is for receiving audio or video signal.Input unit 104 may include graphics processor (Graphics Processing Unit, GPU) 1041 and microphone 1042, graphics processor 1041 is in video acquisition mode Or the image data of the static images or video obtained in image capture mode by image capture apparatus (such as camera) carries out Reason.Treated, and picture frame may be displayed on display unit 106.Through graphics processor 1041, treated that picture frame can be deposited Storage is sent in memory 109 (or other storage mediums) or via radio frequency unit 101 or network module 102.Mike Wind 1042 can receive sound, and can be audio data by such acoustic processing.Treated audio data can be The format output that mobile communication base station can be sent to via radio frequency unit 101 is converted in the case where telephone calling model.

Terminal device further includes at least one sensor 105, such as optical sensor, motion sensor and other sensings Device.Specifically, optical sensor includes ambient light sensor and proximity sensor, wherein ambient light sensor can be according to environment light The light and shade of line adjusts the brightness of display panel 1061, and proximity sensor can close display when terminal device is moved in one's ear Panel 1061 and/or backlight.As a kind of motion sensor, accelerometer sensor can detect (generally three in all directions Axis) acceleration size, can detect that size and the direction of gravity when static, it is (such as horizontal to can be used to identify terminal device posture Vertical screen switching, dependent game, magnetometer pose calibrating), Vibration identification correlation function (such as pedometer, tap) etc.；Sensor 105 can also include fingerprint sensor, pressure sensor, iris sensor, molecule sensor, gyroscope, barometer, humidity Meter, thermometer, infrared sensor etc., details are not described herein.

Display unit 106 is for showing information input by user or being supplied to the information of user.Display unit 106 can wrap Display panel 1061 is included, liquid crystal display (Liquid Crystal Display, LCD), Organic Light Emitting Diode can be used Forms such as (Organic Light-Emitting Diode, OLED) configure display panel 1061.

User input unit 107 can be used for receiving the number or character information of input, and generate the use with terminal device Family setting and the related key signals input of function control.Specifically, user input unit 107 include touch panel 1071 and Other input equipments 1072.Touch panel 1071, also referred to as touch screen collect the touch operation of user on it or nearby (for example user uses any suitable objects or attachment such as finger, stylus on touch panel 1071 or in touch panel 1071 Neighbouring operation).Touch panel 1071 may include both touch detecting apparatus and touch controller.Wherein, touch detection Device detects the touch orientation of user, and detects touch operation bring signal, transmits a signal to touch controller；Touch control Device processed receives touch information from touch detecting apparatus, and is converted into contact coordinate, then gives processor 110, receiving area It manages the order that device 110 is sent and is executed.Furthermore, it is possible to more using resistance-type, condenser type, infrared ray and surface acoustic wave etc. Seed type realizes touch panel 1071.In addition to touch panel 1071, user input unit 107 can also include other input equipments 1072.Specifically, other input equipments 1072 can include but is not limited to physical keyboard, function key (such as volume control button, Switch key etc.), trace ball, mouse, operating stick, details are not described herein.

Further, touch panel 1071 can be covered on display panel 1061, when touch panel 1071 is detected at it On or near touch operation after, send processor 110 to determine the type of touch event, be followed by subsequent processing device 110 according to touching The type for touching event provides corresponding visual output on display panel 1061.Although in fig. 8, touch panel 1071 and display Panel 1061 is the function that outputs and inputs of realizing terminal device as two independent components, but in some embodiments In, can be integrated by touch panel 1071 and display panel 1061 and realize the function that outputs and inputs of terminal device, it is specific this Place is without limitation.

Interface unit 108 is the interface that external device (ED) is connect with terminal device.For example, external device (ED) may include it is wired or Wireless head-band earphone port, external power supply (or battery charger) port, wired or wireless data port, memory card port, For connecting port, the port audio input/output (I/O), video i/o port, ear port of the device with identification module Etc..Interface unit 108 can be used for receiving the input (for example, data information, electric power etc.) from external device (ED) and will The one or more elements or can be used in terminal device and external device (ED) that the input received is transferred in terminal device Between transmit data.

Memory 109 can be used for storing software program and various data.Memory 109 can mainly include storing program area The storage data area and, wherein storing program area can (such as the sound of application program needed for storage program area, at least one function Sound playing function, image player function etc.) etc.；Storage data area can store according to mobile phone use created data (such as Audio data, phone directory etc.) etc..In addition, memory 109 may include high-speed random access memory, it can also include non-easy The property lost memory, a for example, at least disk memory, flush memory device or other volatile solid-state parts.

Processor 110 is the control centre of terminal device, utilizes each of various interfaces and the entire terminal device of connection A part by running or execute the software program and/or module that are stored in memory 109, and calls and is stored in storage Data in device 109 execute the various functions and processing data of terminal device, to carry out integral monitoring to terminal device.Place Managing device 110 may include one or more processing units；Optionally, processor 110 can integrate application processor and modulatedemodulate is mediated Manage device, wherein the main processing operation system of application processor, user interface and application program etc., modem processor is main Processing wireless communication.It is understood that above-mentioned modem processor can not also be integrated into processor 110.

Terminal device can also include the power supply 111 (such as battery) powered to all parts, and optionally, power supply 111 can With logically contiguous by power-supply management system and processor 110, thus charged, discharged by power-supply management system realization management, And the functions such as power managed.

In addition, terminal device includes some unshowned functional modules, details are not described herein.

Embodiment three,

The embodiment of the present invention also provides the system for generating recording documents, specifically, referring to shown in Fig. 6, generation record text The system of shelves includes: terminal device (621,622 ... described in audio-visual acquisition device 61 and at least one above-described embodiment two 62n)。

The system provided in an embodiment of the present invention for generating recording documents obtains M target image first and adopts from audio-visual Collect equipment this at the time of execute Image Acquisition to performance objective Image Acquisition next time at the time of or stop recording ambient sound The M target audio that ambient sound obtains is recorded in period between moment, then by the M target image respectively with it is described M target audio is synthesized, and M destination document is generated, and finally generates recording documents further according to the M destination document, by To next at the time of terminal device provided in an embodiment of the present invention can obtain acquisition target image when obtaining target image It is secondary execute Image Acquisition at the time of or stop record ambient sound at the time of between target audio, and synthesize the target image and The target audio generates destination document, therefore the embodiment of the present invention only need to be according to the sequence of acquisition destination document to destination document Recording documents can be obtained by carrying out integration, middle compared with the prior art to obtain all target images and the whole audio recorded, so All target images and the whole audio recorded are integrated again afterwards, the embodiment of the present invention only need to be according to the target image of acquisition Sequence destination document is integrated can obtain recording documents, therefore the embodiment of the present invention can reduce acquisition recording documents Workload, and then solve the problems, such as obtain recording documents process it is very complicated.

Example IV,

The embodiment of the present invention also provides a kind of computer readable storage medium, and meter is stored on computer readable storage medium Calculation machine program, the computer program realize each mistake of the embodiment of the method for above-mentioned generation recording documents when being executed by processor Journey, and identical technical effect can be reached, to avoid repeating, which is not described herein again.Wherein, computer readable storage medium, Such as read-only memory (Read-Only Memory, abbreviation ROM), random access memory (Random Access Memory, letter Claim RAM), magnetic or disk etc..

Wherein, terminal device provided in an embodiment of the present invention, computer storage medium are used to execute presented above Corresponding method, therefore, attainable beneficial effect can refer to the beneficial effect in corresponding method presented above, Details are not described herein again.

It should be noted that, in this document, the terms "include", "comprise" or its any other variant are intended to non-row His property includes, so that the process, method, article or the device that include a series of elements not only include those elements, and And further include other elements that are not explicitly listed, or further include for this process, method, article or device institute it is intrinsic Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including being somebody's turn to do There is also other identical elements in the process, method of element, article or device.

Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can be realized by means of software and necessary general hardware platform, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on this understanding, technical solution of the present invention substantially in other words does the prior art The part contributed out can be embodied in the form of software products, which is stored in a storage medium In (such as ROM/RAM, magnetic disk, CD), including some instructions are used so that a terminal (can be mobile phone, computer, service Device, air conditioner or network equipment etc.) execute method described in each embodiment of the present invention.

The embodiment of the present invention is described with above attached drawing, but the invention is not limited to above-mentioned specific Embodiment, the above mentioned embodiment is only schematical, rather than restrictive, those skilled in the art Under the inspiration of the present invention, without breaking away from the scope protected by the purposes and claims of the present invention, it can also make very much Form belongs within protection of the invention.

Claims

1. a kind of method for generating recording documents, is applied to terminal device, which is characterized in that the described method includes:

M target image and M target audio from audio-visual acquisition equipment are obtained, the target audio audio-visual is adopted to be described Collection equipment records the audio of ambient sound acquisition in the target time period, and the target time section is from the audio-visual acquisition equipment sheet At the time of executing Image Acquisition next time to the audio-visual acquisition equipment at the time of secondary execution Image Acquisition or stop recording environment Period between at the time of sound；

The M target image is synthesized with the M target audio respectively, M destination document is generated, according to the M A destination document generates recording documents；

Wherein, M is the integer greater than 1.

2. the method according to claim 1, wherein by the M target image respectively with the M target Audio is synthesized, before generating M destination document, the method also includes:

The M target image is shown on M display interface respectively；

Receive the notes that user inputs at least one display interface of the M display interface；

M target interface is generated according to the notes that the M display interface and user input；

It is described to synthesize the M target image with the M target audio respectively, generate M destination document, comprising:

The M target interface is synthesized with the M target audio respectively, generates M destination document.

3. method according to claim 1 or 2, which is characterized in that described to obtain the M target from audio-visual acquisition equipment Image and M target audio, comprising:

It controls the audio-visual acquisition device and carries out the M Image Acquisition acquisition M target image in a targeted way, and exist respectively Ambient sound is recorded in M target time section obtains the M target audio；

4. according to the method described in claim 3, it is characterized in that, the control audio-visual acquisition device in a targeted way into M Image Acquisition of row obtains the M target image, comprising:

The preview image that the audio-visual acquisition device obtains predeterminable area in real time is controlled, and in the variation of adjacent two frames preview image When amount is greater than variable quantity threshold value, an Image Acquisition is carried out to the predeterminable area and obtains a target image.

5. according to the method described in claim 3, it is characterized in that, the control audio-visual acquisition device in a targeted way into M Image Acquisition of row obtains the M target image, comprising:

It controls the audio-visual acquisition device and Image Acquisition one target of acquisition is carried out to predeterminable area in predetermined time Image.

6. according to the method described in claim 3, it is characterized in that, the control audio-visual acquisition device in a targeted way into M Image Acquisition of row obtains the M target image, comprising:

Receive the first operation of user's input；

In response to first operation, control controls the audio-visual acquisition device and carries out an Image Acquisition acquisition to predeterminable area One target image.

7. a kind of terminal device characterized by comprising

Acquiring unit, for obtaining M target image and M target audio from audio-visual acquisition equipment, the target sound Frequency is the audio that the audio-visual acquisition equipment records ambient sound acquisition in the target time period, and the target time section is from described Audio-visual acquisition equipment this execute Image Acquisition at the time of to next time execute Image Acquisition at the time of or stop recording ambient sound At the time of between period；

Generation unit generates M target text for synthesizing the M target image with the M target audio respectively Shelves generate recording documents according to the M destination document；

Wherein, M is the integer greater than 1.

8. a kind of terminal device, which is characterized in that including processor, memory and be stored on the memory and can be described The computer program run on processor is realized when the computer program is executed by the processor as in claim 1 to 6 The step of described in any item methods for generating recording documents.

9. a kind of system for generating recording documents characterized by comprising audio-visual acquisition device and at least one want such as right Terminal device described in asking 7.

10. a kind of computer readable storage medium, which is characterized in that store computer journey on the computer readable storage medium Sequence, the computer program realize the side as claimed in any one of claims 1 to 6 for generating recording documents when being executed by processor The step of method.