CN110322421A - One kind being based on multimedia information processing method - Google Patents

One kind being based on multimedia information processing method Download PDF

Info

Publication number
CN110322421A
CN110322421A CN201910647346.3A CN201910647346A CN110322421A CN 110322421 A CN110322421 A CN 110322421A CN 201910647346 A CN201910647346 A CN 201910647346A CN 110322421 A CN110322421 A CN 110322421A
Authority
CN
China
Prior art keywords
image
signal
information processing
carries out
multimedia
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910647346.3A
Other languages
Chinese (zh)
Inventor
刘恩希
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201910647346.3A priority Critical patent/CN110322421A/en
Publication of CN110322421A publication Critical patent/CN110322421A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/20Image enhancement or restoration by the use of local operators
    • G06T5/30Erosion or dilatation, e.g. thinning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence

Abstract

The invention discloses one kind to be based on multimedia information processing method, comprising the following steps: S1: multimedia messages sampling;S2: image enhancement;S3: image procossing;S4: identification region carries out character representation and description;S5: image segmentation;S6: information coding;S7: Information Compression;S8: audio-frequency information processing;S9: video information process, the present invention successively carries out image enhancement by the multimedia messages to sampling, image procossing, the description of identification region feature and image Segmentation Technology are handled, multimedia messages are integrated, keep multimedia more science intuitive, pass through technology standardization specification, so that audio, video, image etc. carries out integrated treatment, the graphical information for ensuring to generate is vivid, characteristics of image identification carries out the identification of vibration signal using Fourier descriptor, contour feature is transformed from a spatial domain in frequency domain, profile is digitized, so as to better discriminate between different profiles, achieve the purpose that identify object.

Description

One kind being based on multimedia information processing method
Technical field
The present invention relates to technical field of information processing, specially a kind of to be based on multimedia information processing method.
Background technique
Multimedia itself there are two aspect as all modern technologies it be by hardware and software or machine and thought Mixing composition.Multimedia technology and function can conceptually be divided into control system and information.Multimedia why can Realization is by digital technology.Multimedia represents converging for digital control and Digital Media, and computer is numerical control system, and is counted Word media are the state-of-the-art storage of current audio and video and mode of propagation.Multimedia messages refer to text, image, image, Sound and animation etc. are the media information of the form of expression, and meaning is generally acknowledged that the phase referred to storing with taking technology to obtain again Close the digital information in information, especially computer.
In multimedia signal processing, when to information processings such as images, vision is presented existing information processing method Graphical information vividness it is inadequate, it is indefinite for the discrimination of information profile so that identification object the effect is unsatisfactory, be This, it is proposed that a kind of be based on multimedia information processing method.
Summary of the invention
The purpose of the present invention is to provide one kind to be based on multimedia information processing method, multimedia messages is carried out whole It closes, keeps multimedia more science intuitive, by technology standardization specification, so that audio, video, image etc. carry out General Office Reason, it is ensured that the graphical information of generation is vivid, and characteristics of image identification carries out the identification of vibration signal using Fourier descriptor, Contour feature is transformed from a spatial domain in frequency domain, extracts feature vector of the frequency domain information as image, i.e., with a vector generation One profile of table, profile is digitized, and so as to better discriminate between different profiles, achievees the purpose that identify object, to solve The problems mentioned above in the background art.
To achieve the above object, the invention provides the following technical scheme: a kind of be based on multimedia information processing method, packet Include following steps:
S1: multimedia messages sampling acquires multimedia messages data, quantizing noise and receiver noise factor by external equipment Influence, sampling exports interrupted burst pulse, and sampling is exported resulting instantaneous analog signal and protected by sampling output digitized signal Hold a period of time;
S2: unsharp image is apparent from and is emphasized feature, in enlarged image between different objects feature by image enhancement Difference inhibits inappropriate feature, and improving image quality, abundant information amount reinforce image interpretation and recognition effect, meets analysis Needs;
S3: image procossing carries out image reinforcement, including image expansion, holes filling, region point by Morphological scale-space method It cuts;
S4: identification region carries out character representation and description, by extraction image bone, extracts the Fourier descriptor of image, and The identification of vibration signal is carried out by Fourier descriptor;
S5: image segmentation carries out emphasis segmentation to the image that needs are divided, allows image precisely to be identified, be analyzed and understand, reach To the target of image zooming-out;
S6: information coding, by after extraction target image and audio signal carry out coding output;
S7: Information Compression, steps are as follows:
A: audio signal is divided into the voice of telephony quality, the audio signal of amplitude modulation broadcasting quality by the compressed encoding of audio signal With clear stereo signal, when information source generate signal have redundancy when, it is compressed, input signal by encoder into Row analysis synthesis, synthesizes binary coded signal, carries out signal output by decoder;
B: the compressed encoding of vision signal utilizes elimination image very strong correlation bring data redundancy on room and time Degree compresses it to meet application requirement, and input signal carries out analysis synthesis, synthesis binary coding letter by encoder Number, signal output is carried out by decoder;
S8: audio-frequency information processing is directly used after being modified using ready-made material or to ready-made material, and certainly by user Oneself creates;
S9: video information process is believed after editing by being acquired to video information and then to video information using video Breath.
Preferably, in the step S1 sampling rate formula are as follows:
FS=2.5fmax (1).
Preferably, image enhancement includes frequency domain method and space domain method in the step S2.
Preferably, quantization is that the sampled signal of continuous amplitude is converted into discrete time, discrete amplitudes in the step S1 Digital signal, the main problem of quantization is quantization error.
Preferably, image expansion is to obtain relatively with the image of own origin and by reflecting relatively in the step S3 As the expansion based on being shifted;Holes filling is to be filled out using the imfill in Matlab software for bianry image hole It fills, to be used to fill image-region and cavity;Region segmentation refers to that the data being analysed to carry out region division, will wherein feel emerging The data slot of interest, which extracts, to be further processed, and other data is abandoned, the main purpose of region segmentation, is to reduce The data volume of subsequent processing.
Preferably, in the step S4 Fourier descriptor complex function z(t) formula are as follows:
(2)
Wherein, t is time variable, seriesThe referred to as Fourier descriptor of curve C;
When curve distance s is useful in comparison with the time, L is length of curve, Fourier descriptorThen indicate:(3).
Preferably, the compression coding mode of a step sound intermediate frequency signal is divided into waveform coding, analysis synthesis in the step S7 Coding and mixed type coding, the frequency range of audio signal are 300Hz-3400Hz.
Preferably, video image compressing method includes lossy compression and lossless compression in b step in the step S7.
Compared with prior art, the beneficial effects of the present invention are: strict control of the present invention should be based at multimedia information Reason method successively carries out image enhancement, image procossing, the description of identification region feature and image by the multimedia messages to sampling Cutting techniques are handled, and ensure that the integrality of information processing, multimedia messages are integrated, and make multimedia more section It learns intuitively, by technology standardization specification, so that audio, video, image etc. carry out integrated treatment, it is ensured that the graphical information of generation Vivid, characteristics of image identification carries out the identification of vibration signal using Fourier descriptor, and contour feature is become from spatial domain It changes in frequency domain, extracts feature vector of the frequency domain information as image, i.e., represent a profile with a vector, by profile number Change, so as to better discriminate between different profiles, achievees the purpose that identify object.
Specific embodiment
In order to make the objectives, technical solutions, and advantages of the present invention clearer, below in conjunction with specific embodiment, to this Invention is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, not For limiting the present invention.
One kind being based on multimedia information processing method, comprising the following steps:
S1: multimedia messages sampling acquires multimedia messages data, quantizing noise and receiver noise factor by external equipment Influence, sampling exports interrupted burst pulse, and sampling is exported resulting instantaneous analog signal and protected by sampling output digitized signal Hold a period of time;
S2: unsharp image is apparent from and is emphasized feature, in enlarged image between different objects feature by image enhancement Difference inhibits inappropriate feature, and improving image quality, abundant information amount reinforce image interpretation and recognition effect, meets analysis Needs;
S3: image procossing carries out image reinforcement, including image expansion, holes filling, region point by Morphological scale-space method It cuts;
S4: identification region carries out character representation and description, by extraction image bone, extracts the Fourier descriptor of image, and The identification of vibration signal is carried out by Fourier descriptor;
S5: image segmentation carries out emphasis segmentation to the image that needs are divided, allows image precisely to be identified, be analyzed and understand, reach To the target of image zooming-out;
S6: information coding, by after extraction target image and audio signal carry out coding output;
S7: Information Compression, steps are as follows:
A: audio signal is divided into the voice of telephony quality, the audio signal of amplitude modulation broadcasting quality by the compressed encoding of audio signal With clear stereo signal, when information source generate signal have redundancy when, it is compressed, input signal by encoder into Row analysis synthesis, synthesizes binary coded signal, carries out signal output by decoder;
B: the compressed encoding of vision signal utilizes elimination image very strong correlation bring data redundancy on room and time Degree compresses it to meet application requirement, and input signal carries out analysis synthesis, synthesis binary coding letter by encoder Number, signal output is carried out by decoder;
S8: audio-frequency information processing is directly used after being modified using ready-made material or to ready-made material, and certainly by user Oneself creates;
S9: video information process is believed after editing by being acquired to video information and then to video information using video Breath.
Specifically, in the step S1 sampling rate formula are as follows:
FS=2.5fmax (1).
Specifically, image enhancement includes frequency domain method and space domain method in the step S2.
Specifically, quantization is that the sampled signal of continuous amplitude is converted into discrete time, discrete amplitudes in the step S1 Digital signal, the main problem of quantization is quantization error.
Specifically, image expansion is to obtain relatively with the image of own origin and by reflecting relatively in the step S3 As the expansion based on being shifted;Holes filling is to be filled out using the imfill in Matlab software for bianry image hole It fills, to be used to fill image-region and cavity;Region segmentation refers to that the data being analysed to carry out region division, will wherein feel emerging The data slot of interest, which extracts, to be further processed, and other data is abandoned, the main purpose of region segmentation, is to reduce The data volume of subsequent processing.
Specifically, in the step S4 Fourier descriptor complex function z(t) formula are as follows:
(2)
Wherein, t is time variable, seriesThe referred to as Fourier descriptor of curve C;
When curve distance s is useful in comparison with the time, L is length of curve, Fourier descriptorThen indicate:(3).
Specifically, the compression coding mode of a step sound intermediate frequency signal is divided into waveform coding, analysis synthesis in the step S7 Coding and mixed type coding, the frequency range of audio signal are 300Hz-3400Hz.
Specifically, video image compressing method includes lossy compression and lossless compression in b step in the step S7.
In summary: strict control of the present invention should be based on multimedia information processing method, pass through the multimedia to sampling Information successively carries out image enhancement, image procossing, the description of identification region feature and image Segmentation Technology and is handled, and ensure that letter The integrality for ceasing processing, multimedia messages are integrated, and are kept multimedia more science intuitive, are advised by technology standardization Model, so that audio, video, image etc. carry out integrated treatment, it is ensured that the graphical information of generation is vivid, and characteristics of image identification is adopted The identification that vibration signal is carried out with Fourier descriptor, contour feature is transformed from a spatial domain in frequency domain, extracts frequency domain information As the feature vector of image, i.e., a profile is represented with a vector, profile is digitized, so as to better discriminate between difference Profile, achieve the purpose that identify object.
The foregoing is only a preferred embodiment of the present invention, but scope of protection of the present invention is not limited thereto, Anyone skilled in the art in the technical scope disclosed by the present invention, according to the technique and scheme of the present invention and its Inventive concept is subject to equivalent substitution or change, should be covered by the protection scope of the present invention.

Claims (8)

1. one kind is based on multimedia information processing method, it is characterised in that: the following steps are included:
S1: multimedia messages sampling acquires multimedia messages data, quantizing noise and receiver noise factor by external equipment Influence, sampling exports interrupted burst pulse, and sampling is exported resulting instantaneous analog signal and protected by sampling output digitized signal Hold a period of time;
S2: unsharp image is apparent from and is emphasized feature, in enlarged image between different objects feature by image enhancement Difference inhibits inappropriate feature, and improving image quality, abundant information amount reinforce image interpretation and recognition effect, meets analysis Needs;
S3: image procossing carries out image reinforcement, including image expansion, holes filling, region point by Morphological scale-space method It cuts;
S4: identification region carries out character representation and description, by extraction image bone, extracts the Fourier descriptor of image, and The identification of vibration signal is carried out by Fourier descriptor;
S5: image segmentation carries out emphasis segmentation to the image that needs are divided, allows image precisely to be identified, be analyzed and understand, reach To the target of image zooming-out;
S6: information coding, by after extraction target image and audio signal carry out coding output;
S7: Information Compression, steps are as follows:
A: audio signal is divided into the voice of telephony quality, the audio signal of amplitude modulation broadcasting quality by the compressed encoding of audio signal With clear stereo signal, when information source generate signal have redundancy when, it is compressed, input signal by encoder into Row analysis synthesis, synthesizes binary coded signal, carries out signal output by decoder;
B: the compressed encoding of vision signal utilizes elimination image very strong correlation bring data redundancy on room and time Degree compresses it to meet application requirement, and input signal carries out analysis synthesis, synthesis binary coding letter by encoder Number, signal output is carried out by decoder;
S8: audio-frequency information processing is directly used after being modified using ready-made material or to ready-made material, and certainly by user Oneself creates;
S9: video information process is believed after editing by being acquired to video information and then to video information using video Breath.
2. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S1 The formula of sampling rate are as follows:
FS=2.5fmax (1).
3. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S2 Image enhancement includes frequency domain method and space domain method.
4. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S1 Quantization is that the sampled signal of continuous amplitude is converted into discrete time, the digital signal of discrete amplitudes, and the main problem of quantization is Quantization error.
5. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S3 Image expansion is the expansion based on obtaining being shifted with respect to the image with own origin and by opposite image;Hole Filling is using the imfill in Matlab software for bianry image holes filling, to be used to fill image-region and cavity; Region segmentation refers to that the data that are analysed to carry out region division, wherein interested data slot will extract and do further Processing, and other data are abandoned, the main purpose of region segmentation, it is the data volume for reducing subsequent processing.
6. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S4 The complex function z(t of Fourier descriptor) formula are as follows:
(2)
Wherein, t is time variable, seriesThe referred to as Fourier descriptor of curve C;
When curve distance s is useful in comparison with the time, L is length of curve, Fourier descriptorThen indicate:(3).
7. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S7 The compression coding mode of a step sound intermediate frequency signal is divided into waveform coding, analysis composite coding and mixed type coding, audio signal Frequency range is 300Hz-3400Hz.
8. according to claim 1 a kind of based on multimedia information processing method, it is characterised in that: in the step S7 Video image compressing method includes lossy compression and lossless compression in b step.
CN201910647346.3A 2019-07-17 2019-07-17 One kind being based on multimedia information processing method Pending CN110322421A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910647346.3A CN110322421A (en) 2019-07-17 2019-07-17 One kind being based on multimedia information processing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910647346.3A CN110322421A (en) 2019-07-17 2019-07-17 One kind being based on multimedia information processing method

Publications (1)

Publication Number Publication Date
CN110322421A true CN110322421A (en) 2019-10-11

Family

ID=68123755

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910647346.3A Pending CN110322421A (en) 2019-07-17 2019-07-17 One kind being based on multimedia information processing method

Country Status (1)

Country Link
CN (1) CN110322421A (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0616456A2 (en) * 1993-02-19 1994-09-21 Canon Kabushiki Kaisha Multimedia communication system, transmitter and receiver therefor
CN1149795A (en) * 1995-11-02 1997-05-14 邝冬英 Multi media digital transmission broadcasting system
CN1492632A (en) * 2002-10-23 2004-04-28 联想(北京)有限公司 Multimedia system based on digital household network
CN2922341Y (en) * 2006-07-13 2007-07-11 中兴通讯股份有限公司 Video meeting terminal capable of realizing high-definition rideo signal input and output
CN101621294A (en) * 2009-07-29 2010-01-06 北京中星微电子有限公司 Control logical circuit and successive approximation analog-to-digital converter
CN106874888A (en) * 2017-03-13 2017-06-20 无锡亚天光电科技有限公司 A kind of feature by distributed optical fiber vibration signal pattern strengthens and signal processing method
CN106991381A (en) * 2017-03-13 2017-07-28 无锡亚天光电科技有限公司 A kind of distributed optical fiber vibration signal Recognition Algorithm based on two-dimensional matrix feature recognition

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0616456A2 (en) * 1993-02-19 1994-09-21 Canon Kabushiki Kaisha Multimedia communication system, transmitter and receiver therefor
CN1149795A (en) * 1995-11-02 1997-05-14 邝冬英 Multi media digital transmission broadcasting system
CN1492632A (en) * 2002-10-23 2004-04-28 联想(北京)有限公司 Multimedia system based on digital household network
CN2922341Y (en) * 2006-07-13 2007-07-11 中兴通讯股份有限公司 Video meeting terminal capable of realizing high-definition rideo signal input and output
CN101621294A (en) * 2009-07-29 2010-01-06 北京中星微电子有限公司 Control logical circuit and successive approximation analog-to-digital converter
CN106874888A (en) * 2017-03-13 2017-06-20 无锡亚天光电科技有限公司 A kind of feature by distributed optical fiber vibration signal pattern strengthens and signal processing method
CN106991381A (en) * 2017-03-13 2017-07-28 无锡亚天光电科技有限公司 A kind of distributed optical fiber vibration signal Recognition Algorithm based on two-dimensional matrix feature recognition

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
作业帮用户: "简述多媒体信息数字化的主要步骤以及每步的主要功能(要快!)", 《HTTPS://WWW.ZYBANG.COM/QUESTION/8A11CF73B49703A8A426CB5346824752.HTML#TOP》 *
张俊: "浅议多媒体信息处理技术", 《民营科技》 *
百度百科: "图像增强", 《HTTPS://BAIKE.BAIDU.COM/HISTORY/%E5%9B%BE%E5%83%8F%E5%A2%9E%E5%BC%BA/5199407/130451697》 *
百度百科: "形状识别", 《HTTPS://BAIKE.BAIDU.COM/ITEM/%E5%BD%A2%E7%8A%B6%E8%AF%86%E5%88%AB/20723894》 *
陈明: "《多媒体技术基础》", 31 August 2000 *

Similar Documents

Publication Publication Date Title
US11270709B2 (en) Efficient coding of audio scenes comprising audio objects
JP4063670B2 (en) Wideband signal transmission system
EP2272062B1 (en) An audio signal classifier
CN100380975C (en) Method for generating hashes from a compressed multimedia content
EP3025330B1 (en) Apparatus and method for efficient object metadata coding
US9892737B2 (en) Efficient coding of audio scenes comprising audio objects
JP6911117B2 (en) Devices and methods for decomposing audio signals using variable thresholds
CN110838894B (en) Speech processing method, device, computer readable storage medium and computer equipment
US7418393B2 (en) Data reproduction device, method thereof and storage medium
AU2006233504A1 (en) Apparatus and method for generating multi-channel synthesizer control signal and apparatus and method for multi-channel synthesizing
EP1470550A1 (en) Audio encoding and decoding device and methods thereof
US11183199B2 (en) Apparatus and method for decomposing an audio signal using a ratio as a separation characteristic
TWI281657B (en) Method and system for speech coding
CN110322421A (en) One kind being based on multimedia information processing method
Oh et al. A new spectral enhancement algorithm in MP3 audio
CN113314130B (en) Audio object coding and decoding method based on frequency spectrum movement
JP2003522981A (en) Error correction method with pitch change detection
JP2002049383A (en) Digital signal processing method and learning method and their devices, and program storage medium
CN113206773A (en) Improved method and apparatus relating to speech quality estimation
JP2003508806A (en) Transmission system with improved encoder and decoder

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191011

RJ01 Rejection of invention patent application after publication