CN105677707A

CN105677707A - Method and terminal for achieving picture processing

Info

Publication number: CN105677707A
Application number: CN201511000743.XA
Authority: CN
Inventors: 夏文君
Original assignee: Nubia Technology Co Ltd
Current assignee: Nubia Technology Co Ltd
Priority date: 2015-12-28
Filing date: 2015-12-28
Publication date: 2016-06-15
Also published as: WO2017114390A1

Abstract

The present invention discloses a method and terminal for achieving picture processing. The method comprises the steps of adding corresponding audio data to one or more than one display area of a picture in original file data of the picture; and playing the audio data corresponding to the display areas according to a received trigger instruction corresponding to each display area. According to the method, the audio data corresponding to the different display areas are added to the picture, when the picture is checked, the audio data are played according to the received trigger instructions, and picture display quality and user experience can be improved.

Description

A kind of method realizing picture processing and terminal

Technical field

The present invention relates to multimedia application technology, espespecially a kind of method realizing picture processing and terminal.

Background technology

Along with the development of multimedia technology, the display that can be realized the multimedia file such as picture, audio frequency by the terminal of more and more kinds is play, and user can pass through terminal and receives multimedia file and check.

At present, when carrying out picture display by terminal, obtain pictorial information only by vision, the more valuable contents contained well are not represented, have impact on the display quality of picture and Consumer's Experience in picture.

Summary of the invention

In order to solve above-mentioned technical problem, the present invention provides a kind of method realizing picture processing and terminal, it is possible to increase the display quality of picture.

In order to reach the object of the invention, the invention provides a kind of terminal realizing picture processing, including: the first adding device and triggering broadcast unit; Wherein,

First adding device is used for, and in the original data of picture, one or more viewing areas of picture is added corresponding voice data;

Triggering broadcast unit is used for, and plays, according to the corresponding triggering command in each viewing area received, the voice data becoming corresponding relation with viewing area.

Optionally, the first adding device specifically for,

Afterbody in the described original data of picture adds the corresponding voice data of zone position information and each viewing area comprising one or more viewing areas.

Optionally, this terminal also includes the second adding device,

Second adding device is used for, after the first adding device adds described voice data,

Add the document size information of each voice data and/or the picture size information of described picture and/or added the identification information of described voice data.

On the other hand, the application also provides for a kind of method realizing picture processing, including:

Corresponding voice data is added in one or more viewing areas of picture by the original data of picture;

The voice data becoming corresponding relation with viewing area is play according to the corresponding triggering command in each viewing area received.

Optionally, original data include: open, by file stream mode, the data that described picture obtains.

Optionally, original data are image cache buffer data.

Optionally, add corresponding voice data for one or more viewing areas of picture to specifically include:

Optionally, after adding voice data, the method also includes:

Add the document size information of each voice data; And/or,

Add the picture size information of described picture; And/or,

Add the identification information having added described voice data.

Optionally, voice data includes: the voice data inputted by mike and/or the voice data prestored.

Optionally, viewing area is region set in advance or receives the region that external command selects.

Compared with prior art, technical scheme includes: in the original data of picture, one or more viewing areas of picture are added corresponding voice data; The voice data becoming corresponding relation with viewing area is play according to the corresponding triggering command in each viewing area received. The inventive method is by adding the corresponding voice data in different viewing areas for picture, and when carrying out picture and checking, the triggering command according to receiving carries out voice data broadcasting, improves quality and Consumer's Experience that picture shows.

Accompanying drawing explanation

Accompanying drawing described herein is used for providing a further understanding of the present invention, constitutes the part of the application, and the schematic description and description of the present invention is used for explaining the present invention, is not intended that inappropriate limitation of the present invention. In the accompanying drawings:

Fig. 1 realizes the hardware architecture diagram of an optional mobile terminal in each embodiment of the present invention;

Fig. 2 is the flow chart that the embodiment of the present invention realizes the method for picture processing;

Fig. 3 is the schematic diagram that the embodiment of the present invention adds a voice data in the original of picture;

Fig. 4 is the schematic diagram that the embodiment of the present invention adds multiple voice data in the original of picture;

Fig. 5 is the flow chart that another embodiment of the present invention realizes the method for picture processing;

Fig. 6 is the structured flowchart that the embodiment of the present invention realizes the terminal of picture processing.

Detailed description of the invention

For making the object, technical solutions and advantages of the present invention clearly understand, below in conjunction with accompanying drawing, embodiments of the invention are described in detail. It should be noted that when not conflicting, the embodiment in the application and the feature in embodiment can combination in any mutually.

The mobile terminal realizing each embodiment of the present invention is described referring now to accompanying drawing. In follow-up description, use the suffix being used for representing such as " module ", " parts " or " unit " of element only for being conducive to the explanation of the present invention, itself do not have specific meaning. Therefore, " module " and " parts " can mixedly use.

Mobile terminal can be implemented in a variety of manners. Such as, the terminal described in the present invention can include the mobile terminal of such as mobile phone, smart phone, notebook computer, PDA (personal digital assistant), PAD (panel computer), PMP (portable media player) etc. and the fixed terminal of such as numeral TV, desk computer etc. Hereinafter it is assumed that terminal is mobile terminal. However, it will be understood by those skilled in the art that, except being used in particular for the element of mobile purpose, structure according to the embodiment of the present invention can also apply to the terminal of fixed type.

Fig. 1 realizes the hardware architecture diagram of an optional mobile terminal in each embodiment of the present invention.

Mobile terminal 100 can include A/V (audio/video) input block 120, user input unit 130, output unit 150, memorizer 160, controller 180 and power subsystem 190 etc. Fig. 1 illustrates the mobile terminal with various assembly, it should be understood that be not required for implementing all assemblies illustrated. Can alternatively implement more or less of assembly. Will be discussed in more detail below the element of mobile terminal.

A/V input block 120 is used for receiving audio frequency. A/V input block 120 can include camera 121 and mike 1220, and the view data of the camera 121 static images to being obtained by image capture apparatus in Video Capture pattern or image capture mode or video processes. Picture frame after process may be displayed on display unit 151. Picture frame after camera 121 processes can be stored in memorizer 160 (or other storage medium) or be transmitted via wireless communication unit 110, it is possible to provide two or more cameras 1210 according to the structure of mobile terminal. Such acoustic processing can via microphones sound (voice data) in logging mode, speech recognition mode etc. operational mode, and can be voice data by mike 122. Mike 122 can implement various types of noise elimination (or suppression) algorithm to eliminate (or suppression) in the noise received and produce in the process of transmission audio signal or interference.

User input unit 130 can generate key input data to control the various operations of mobile terminal according to the order of user's input. User input unit 130 allows user to input various types of information, and can include keyboard, metal dome, touch pad (such as, detection due to touched and cause resistance, pressure, electric capacity etc. the sensitive component of change), roller, rocking bar etc. Especially, when touch pad is superimposed upon on display unit 151 as a layer, it is possible to form touch screen.

Output unit 150 is configured to provide output signal (such as, audio signal, video signal, alarm signal, vibration signal etc.) with vision, audio frequency and/or tactile manner. Output unit 150 can include display unit 151, dio Output Modules 152 etc.

Display unit 151 may be displayed on the information processed in mobile terminal 100.

Meanwhile, when display unit 151 and touch pad as a layer superposed on one another to form touch screen time, display unit 151 can serve as input equipment and output device. Display unit 151 can include at least one in liquid crystal display (LCD), thin film transistor (TFT) LCD (TFT-LCD), Organic Light Emitting Diode (OLED) display, flexible display, three-dimensional (3D) display etc. Some in these display may be constructed such that transparence is to allow user to watch from outside, and this is properly termed as transparent display, and typical transparent display can be such as TOLED (transparent organic light emitting diode) display etc. According to the specific embodiment wanted, mobile terminal 100 can include two or more display units (or other display device), such as, mobile terminal can include outernal display unit (not shown) and inner display unit (not shown). Touch screen can be used for detecting touch input pressure and touch input position and touch input area.

Dio Output Modules 152 can provide the audio frequency output (such as, call signal receive sound, message sink sound etc.) relevant to the specific function of mobile terminal 100 execution.Dio Output Modules 152 can include speaker, buzzer etc.

Memorizer 160 can store the process performed by controller 180 and the software program controlling operation etc., or can temporarily store the data (such as, telephone directory, message, still image, video etc.) that oneself maybe will export through output. And, memorizer 160 can store the vibration about the various modes exported when touching and being applied to touch screen and the data of audio signal.

Memorizer 160 can include the storage medium of at least one type, described storage medium includes flash memory, hard disk, multimedia card, card-type memorizer (such as, SD or DX memorizer etc.), random access storage device (RAM), static random-access memory (SRAM), read only memory (ROM), Electrically Erasable Read Only Memory (EEPROM), programmable read only memory (PROM), magnetic storage, disk, CD etc. And, mobile terminal 100 can be connected the network storage device cooperation of the storage function performing memorizer 160 with by network.

Controller 180 generally controls the overall operation of mobile terminal. Such as, controller 180 performs the control relevant to voice call, data communication, video calling etc. and process. It addition, controller 180 can include the multi-media module 1810 for reproducing (or playback) multi-medium data, multi-media module 1810 can construct in controller 180, or it is so structured that separates with controller 180. Controller 180 can perform pattern recognition process, so that the handwriting input performed on the touchscreen or picture drafting input are identified as character or image.

Power subsystem 190 receives external power or internal power under the control of controller 180 and provides the suitable electric power operated needed for each element and assembly.

Various embodiment described herein can to use such as computer software, hardware or its any combination of computer-readable medium to implement. Hardware is implemented, embodiment described herein can pass through to use application-specific IC (ASIC), digital signal processor (DSP), digital signal processing device (DSPD), programmable logic device (PLD), field programmable gate array (FPGA), processor, controller, microcontroller, microprocessor, at least one that is designed to perform in the electronic unit of function described herein to implement, in some cases, such embodiment can be implemented in controller 180. Implementing for software, the embodiment of such as process or function can be implemented with allowing the independent software module performing at least one function or operation. Software code can be implemented by the software application (or program) write with any suitable programming language, and software code can be stored in memorizer 160 and be performed by controller 180.

So far, oneself is through describing mobile terminal according to its function. Below, for the sake of brevity, by the slide type mobile terminal in the various types of mobile terminals describing such as folded form, board-type, oscillating-type, slide type mobile terminal etc. exemplarily. Therefore, the present invention can be applied to any kind of mobile terminal, and is not limited to slide type mobile terminal.

Based on above-mentioned mobile terminal hardware configuration and communication system, it is proposed to each embodiment of the inventive method.

Fig. 2 is the flow chart that the embodiment of the present invention realizes the method for picture processing, as in figure 2 it is shown, include:

Step 200, in the original data of picture, corresponding voice data is added in one or more viewing areas of picture;

Optionally,

Original data include: open, by file stream mode, the data that picture obtains.

It should be noted that open picture by file stream mode to obtain the alternative embodiment that original data method is the embodiment of the present invention of picture; Can be applied to realize the embodiment of the present invention for other any methods that can obtain original data.

Optionally,

Original data are image cache (buffer) data.

It should be noted that image cache data are the embodiment of the present invention optionally one data type, it will be appreciated by those skilled in the art that and can be applied to implement the present invention for the other kinds of original data that can carry out voice data interpolation.

It should be noted that the voice data inputted by mike is mainly used in: the picture processing carried out at shooting picture scene; Or, it is possible to meet and carry out the sound that the voice data of picture processing requires; The voice data of picture processing carried out at shooting picture scene generally comprises the sound of actual scene, for instance, the tweedle in forest, the hubbub at spacious place, sound of sea wave, sound of the wind, streams sound, steamer blast of whistle, train blow a whistle the more featured sound in the scenes such as sound; Can meet the sound that the voice data carrying out picture processing requires can be individual voice, vehicle whistle sound, telephone rings sound, footsteps, laughter etc. The voice data prestored can be the voice data recorded in advance to carry out picture processing, can also be the voice data downloaded from network according to picture processing demand, it is also possible to be the voice data adopting audio processing software synthesis storage according to demand. The voice data of storage is not intended to source and the kind of voice data.

In this step, one or more viewing areas corresponding voice data of interpolation for picture specifically includes:

Afterbody in the original data of picture adds the corresponding voice data of zone position information and each viewing area comprising one or more viewing areas.

It should be noted that, here zone position information refers generally to the relative position information of coordinate information or picture, if picture shows according to fixed dimension, then zone position information can adopt co-ordinate position information, and co-ordinate position information now can be identical with the relative position information carrying out zone position information description with picture display length. To each viewing area, corresponding voice data can pass through to set up each viewing area and associates with the mapping relations realization of each voice data respectively; Table 1 is the mapping relations signal adding corresponding voice data for each viewing area, and as shown in table 1, the embodiment of the present invention is that viewing area 1, viewing area 2 and viewing area 3 with the addition of different voice data 1, voice data 2 and voice data 3 respectively; Certainly, the embodiment of the present invention can also adopt identical voice data in configuration section viewing area, for instance, configuration viewing area 2 voice data 4 identical with viewing area 3. Concrete arranges and according to user, the understanding and grasping of picture can be modified.

Viewing area is numbered	The corresponding voice data numbering in viewing area
		Viewing area 1	Voice data 1
Viewing area 2	Voice data 2
		Viewing area 3	Voice data 3

Table 1

It should be noted that, picture region of the present invention is generally carried out selection by user and determines, namely determine the coordinate of picture region by user or carry out regional choice by drawing a circle to approve picture region, such as, a secondary picture is the scene on seashore at dusk, picture comprises sunset clouds and mirrors sea, cheerful and light-hearted the running forward in the positive sandy beach of child surged, and open emerging dehiscing backward and look at father and mother, speak as in joy loud and father and mother;Mutual hand in hand father and mother and child keep the distance of three small steps, as responding child, again as what being said at little sound each other; When above-mentioned image is processed, it is necessary to picture displays the division in region, the embodiment of the present invention can carry out the selection in region by receiving external command, for instance, select regional extent by mouse track, determine viewing area by the regional extent selected; For above-mentioned image content, it is possible to the region comprising sea in picture is set as viewing area 1, by child region for being set as viewing area 2, father and mother region is set as viewing area 3.

Step 201, the corresponding triggering command in each viewing area passing through to receive play the voice data becoming corresponding relation with viewing area.

It should be noted that the terminal that the present invention plays with audio frequency by picture can be carried out to show realizes, it is possible to be the equipment such as computer, mobile phone, flat board. Here triggering command can be touchscreen commands or the signal instruction of other typing terminals input, for instance the selection instruction of mouse. Still for the picture of the scene on above-mentioned dusk seashore, it is assumed that the region comprising sea in picture is set as viewing area 1, by child region for being set as viewing area 2, father and mother region is set as viewing area 3. Afterbody in the original data of picture adds the corresponding voice data of zone position information and each viewing area of each viewing area; Namely the voice data that mike input comprises sound of surging or the voice data comprising sound of surging prestored can be passed through in viewing area 1; Mike input can be passed through in viewing area 2 and comprise child's laughter and the voice data of shout sound; Mike input can be passed through in viewing area 3 and comprise the voice data of the talk sound of happiness between father and mother response and father and mother to child. Then in checking picture process, it is assumed that the selection instruction that triggering command is mouse selects viewing area 1, the then voice data comprising sound of surging or the voice data comprising sound of surging prestored to be played; When selecting viewing area 2 by mouse, the voice data comprising child's laughter and shout sound is played; When selecting viewing area 3 by mouse, comprise father and mother and the voice data of the talk sound of happiness between response and the father and mother of child is played. If above-mentioned image display area is adjusted, for instance, the scene in sea is very vast and have artistic conception in picture, then sea can be divided into two or more viewing areas, child and father and mother and can be divided into same viewing area; The multiple viewing areas dividing sea can add identical or different sound of surging respectively, by triggering command trigger big sea region carry out voice data play time, then the broadcasting of many groups of sound of surging can the scene of sea swells in reproduced picture greatly; Using child's laughter and the shout talk sound voice data complete as viewing area of happiness between sound, father and mother response and father and mother to child, it is possible to mutual between complete continuous print reduction personage, good Consumer's Experience can be obtained by picture processing.

Fig. 3 is the schematic diagram that the embodiment of the present invention adds a voice data in the original of picture, as shown in Figure 3, terminal display screen demonstrates the picture of sea swells, a voice data is only added due to embodiment, voice data according to picture display interpolation can be the sound of sea swells, the present embodiment zone line bottom Fig. 3 adds voice data, it is provided with reception triggering command simultaneously in this position and carries out the touch key-press of voice data broadcasting, when user is when consulting picture, triggered the broadcasting carrying out sea swells sound by touch key-press.

Fig. 4 is the schematic diagram that the embodiment of the present invention adds multiple voice data in the original of picture, as shown in Figure 4, the present embodiment with the addition of three voice datas in the original of picture, the viewing area of voice data added is respectively as follows: the Mr. and Mrs on sandy beach, run towards seabeach on wave two children and three viewing areas of child's spray after one's death, the corresponding reception triggering command adding audio icon three viewing areas carries out the touch key-press of audio frequency broadcasting, can the voice data of each viewing area be played out by the triggering of touch key-press. According to practical situation and picture, the position of embodiment of the present invention touch key-press can show that requirement etc. is adjusted, belong to the conventional techniques means of those skilled in the art.

After adding voice data, the embodiment of the present invention also includes:

Add the document size information of each voice data; And/or,

Add the picture size information of picture; And/or,

Add the identification information having added voice data.

Can so that when those skilled in the art carry out picture processing, the voice data added being held it should be noted that add voice data size information; Add picture size information and be determined for whether picture changes; The identification information having added voice data may be used for distinguishing whether carried out the picture processing of the embodiment of the present invention.

The inventive method is by adding the corresponding voice data in different viewing areas for picture, and when carrying out picture and checking, the triggering command according to receiving carries out voice data broadcasting, improves quality and Consumer's Experience that picture shows.

Fig. 5 is the flow chart that another embodiment of the present invention realizes the method for picture processing, as it is shown in figure 5, include:

Step 500, opened by file stream mode picture obtain data;

Optionally,

Original data are image cache (buffer) data.

Step 501, in the original data of picture, corresponding voice data is added in one or more viewing areas of picture;

It should be noted that the voice data inputted by mike is mainly used in: the picture processing carried out at shooting picture scene; Or, it is possible to meet and carry out the sound that the voice data of picture processing requires; The voice data of picture processing carried out at shooting picture scene generally comprises the sound of actual scene, for instance, the tweedle in forest, the hubbub at spacious place, sound of sea wave, sound of the wind, streams sound, steamer blast of whistle, train blow a whistle the more featured sound in the scenes such as sound; Can meet the sound that the voice data carrying out picture processing requires can be individual voice, vehicle whistle sound, telephone rings sound, footsteps, laughter etc. The voice data prestored can be the voice data recorded in advance to carry out picture processing, can also be the voice data downloaded from network according to picture processing demand, it is also possible to be the voice data adopting audio processing software synthesis storage according to demand.The voice data of storage is not intended to source and the kind of voice data.

In this step, one or more viewing areas corresponding voice data of interpolation for picture includes:

It should be noted that, here zone position information refers generally to the relative position information of coordinate information or picture, if picture shows according to fixed dimension, then zone position information can adopt co-ordinate position information, and co-ordinate position information now can be identical with the relative position information carrying out zone position information description with picture display length. To each viewing area, corresponding voice data can pass through to set up each viewing area and associates with the mapping relations realization of each voice data respectively.

Optionally, embodiment of the present invention viewing area is region set in advance or receives the region that external command selects.

It should be noted that, picture region of the present invention is generally carried out selection by user and determines, namely determine the coordinate of picture region by user or carry out regional choice by drawing a circle to approve picture region, such as, a secondary picture is the scene on seashore at dusk, picture comprises sunset clouds and mirrors sea, cheerful and light-hearted the running forward in the positive sandy beach of child surged, and open emerging dehiscing backward and look at father and mother, speak as in joy loud and father and mother; Mutual hand in hand father and mother and child keep the distance of three small steps, as responding child, again as what being said at little sound each other; When above-mentioned image is processed, it is necessary to picture displays the division in region, the embodiment of the present invention can carry out the selection in region by receiving external command, for instance, select regional extent by mouse track, determine viewing area by the regional extent selected; For above-mentioned image content, it is possible to the region comprising sea in picture is set as viewing area 1, by child region for being set as viewing area 2, father and mother region is set as viewing area 3.

Step 502, the original data of picture are added the document size information of each voice data, the picture size information of picture and has added the identification information of voice data.

Can so that when those skilled in the art carry out picture processing, the voice data added being held it should be noted that add voice data size information; Add picture size information and be determined for whether picture changes; The identification information having added voice data may be used for distinguishing whether carried out the picture processing of the embodiment of the present invention. The embodiment of the present invention is added after above-mentioned all the elements, and the content that the afterbody of original data adds includes: (viewing area 1, the file size of voice data 1, voice data 1), (viewing area 2, the file size of voice data 2, voice data 2) ... (viewing area n-1, the file size of voice data n-1, voice data n-1), (regional location n, the file size of voice data n, voice data n), picture size, add the identification information of voice data.

Step 503, the corresponding triggering command in each viewing area passing through to receive play the voice data becoming corresponding relation with viewing area.

It should be noted that the terminal that the present invention plays with audio frequency by picture can be carried out to show realizes, it is possible to be the equipment such as computer, mobile phone, flat board. Here triggering command can be touchscreen commands or the signal instruction of other typing terminals input, for instance the selection instruction of mouse.Appoint so for the picture of the scene on above-mentioned dusk seashore, it is assumed that the region comprising sea in picture is set as viewing area 1, by child region for being set as viewing area 2, father and mother region is set as viewing area 3. Afterbody in the original data of picture adds the corresponding voice data of zone position information and each viewing area of each viewing area; Namely the voice data that mike input comprises sound of surging or the voice data comprising sound of surging prestored can be passed through in viewing area 1; Mike input can be passed through in viewing area 2 and comprise child's laughter and the voice data of shout sound; Mike input can be passed through in viewing area 3 and comprise the voice data of the talk sound of happiness between father and mother response and father and mother to child. Then in checking picture process, it is assumed that the selection instruction that triggering command is mouse selects viewing area 1, the then voice data comprising sound of surging or the voice data comprising sound of surging prestored to be played; When selecting viewing area 2 by mouse, the voice data comprising child's laughter and shout sound is played; When selecting viewing area 3 by mouse, comprise father and mother and the voice data of the talk sound of happiness between response and the father and mother of child is played.

Fig. 6 is the structured flowchart that the embodiment of the present invention realizes the terminal of picture processing, as shown in Figure 6, including: the first adding device and triggering broadcast unit; Wherein,

It should be noted that original data include: open, by file stream mode, the data that picture obtains. Open picture by file stream mode and obtain the alternative embodiment that original data method is the embodiment of the present invention of picture; Can be applied to realize the embodiment of the present invention for other any methods that can obtain original data.

Here, original data can be image cache data. Image cache data are optionally a kind of data type of the embodiment of the present invention, it will be appreciated by those skilled in the art that and can be applied to implement the present invention for the other kinds of original data that can carry out voice data interpolation.

Here, voice data includes: the voice data inputted by mike and/or the voice data prestored. The voice data inputted by mike is mainly used in: the picture processing carried out at shooting picture scene; Or, it is possible to meet and carry out the sound that the voice data of picture processing requires; The voice data of picture processing carried out at shooting picture scene generally comprises the sound of actual scene, for instance, the tweedle in forest, the hubbub at spacious place, sound of sea wave, sound of the wind, streams sound, steamer blast of whistle, train blow a whistle the more featured sound in the scenes such as sound; Can meet the sound that the voice data carrying out picture processing requires can be individual voice, vehicle whistle sound, telephone rings sound, footsteps, laughter etc. The voice data prestored can be the voice data recorded in advance to carry out picture processing, can also be the voice data downloaded from network according to picture processing demand, it is also possible to be the voice data adopting audio processing software synthesis storage according to demand. The voice data of storage is not intended to source and the kind of voice data.

Viewing area is region set in advance or receives the region that external command selects.

First adding device specifically for,

Terminal of the present invention also includes the second adding device,

Second adding device is used for: after the first adding device adds voice data,

Add the document size information of each voice data and/or the picture size information of picture and/or added the identification information of voice data.

A kind of terminal realizing picture processing, including: the first adding device, the second adding device and triggering broadcast unit; Wherein,

It should be noted that original data include: open, by file stream mode, the data that picture obtains;

Original data can be image cache data;

Voice data includes: the voice data inputted by mike and/or the voice data prestored.

First adding device specifically for,

Second adding device is used for, after the first adding device adds voice data,

Although the embodiment that disclosed herein is as above, but described content is only the embodiment readily appreciating the present invention and adopt, and is not limited to the present invention. Technical staff in any art of the present invention; under the premise without departing from the spirit and scope that disclosed herein; any amendment and change can be carried out in the form implemented and details; but the scope of patent protection of the present invention, still must be as the criterion with the scope that appending claims defines.

Claims

1. the terminal realizing picture processing, it is characterised in that including: the first adding device and triggering broadcast unit; Wherein,

2. terminal according to claim 1, it is characterised in that described first adding device specifically for,

3. terminal according to claim 1 and 2, it is characterised in that this terminal also includes the second adding device,

4. the method realizing picture processing, it is characterised in that including:

5. method according to claim 4, it is characterised in that described original data include: open the data that described picture obtains by file stream mode.

6. method according to claim 4, it is characterised in that described original data are image cache buffer data.

7. the method according to claim 4,5 or 6, it is characterised in that described one or more viewing areas for picture are added corresponding voice data and specifically included:

8. the method according to claim 4,5 or 6, it is characterised in that after described interpolation voice data, the method also includes:

Add the document size information of each voice data; And/or,

Add the picture size information of described picture; And/or,

Add the identification information having added described voice data.

9. the method according to claim 4,5 or 6, it is characterised in that described voice data includes: the voice data inputted by mike and/or the voice data prestored.

10. the method according to claim 4,5 or 6, it is characterised in that described viewing area is region set in advance or receives the region that external command selects.