Embodiment
Certain exemplary embodiments of the present invention will be described in detail in conjunction with the accompanying drawings.
Fig. 1 illustrates according to the optionally coded system that comprises of the present invention, the diagrammatic sketch of the network of server and second user's terminating machine.
With reference to figure 1, optionally coded system 100 according to the present invention receives predetermined voice data from the webserver 110.The above-mentioned webserver 100 of exemplary embodiment according to the present invention provides podcast service or rich site data (RSS) service.Therefore, above-mentioned optionally coded system 100 can receive above-mentioned voice data by a predetermined cycle from the above-mentioned webserver 110.And above-mentioned voice data can comprise music data, speech data or broadcast data.
Whether the above-mentioned optionally coded system 100 that receives above-mentioned speech data analytically predicate sound data and decision is included in speech data in the above-mentioned voice data.Determine whether speech data is included in the above-mentioned voice data by the data layout of analyzing above-mentioned voice data and can use traditional technology.Whether for example, formed by people's sound in order to discern above-mentioned speech data, whether decision sound can be employed greater than the cut method of predetermined rate.And, whether whether above-mentioned speech data is comprised in the above-mentioned voice data can be detected by checking pitch predetermined from above-mentioned speech data, or in predetermined frequency band frequency crowded the deciding whether of the above-mentioned speech data of frequency check by discerning above-mentioned speech data.And current mobile communication terminal machine sends band by control in real time such as voice activation detecting device (VAD:VoiceActivity Detector), discontinuous transmission (DTX:Discontinuous Transmission) or variable rate coder (VRC:Variable Rate Codec).Whether to be included in above-mentioned speech data different with the above-mentioned speech data of above-mentioned mobile communication terminal machine Real time identification, can be on details relatively more widely judge according to the how obtainable time whether above-mentioned speech data is comprised in the above-mentioned voice data in predicate sound data analytically according to above-mentioned optionally coded system 100 of the present invention.
Whether the above-mentioned speech data of above-mentioned optionally coded system 100 decisions that receives above-mentioned voice data from above-mentioned server 110 is included in the above-mentioned voice data, and comes coded voice data by predetermined vocoder when above-mentioned speech data is included in the above-mentioned voice data.Can utilize according to the above-mentioned optionally coded system 100 of exemplary embodiment of the present invention and for example to be excited vocoders such as linear predictive coding (QCELP), enhanced variable rate coding (EVRC) and adaptive multi-rate (AMR) voice coding.
Second voice data produces from above-mentioned voice data by vocoder behind coding.When above-mentioned EVRC is used as the voice data that comprises above-mentioned speech data, above-mentioned second voice data be approximately the corresponding bit rate of 8Kbps place and be encoded.And, when above-mentioned speech data is not comprised in the above-mentioned voice data, but when above-mentioned music data or above-mentioned song data were comprised in above-mentioned voice data, above-mentioned optionally coded system 100 was no longer encoded to above-mentioned voice data.
Second user's terminating machine 120 receives above-mentioned second voice data from above-mentioned optionally coded system 100.
According to the above-mentioned optionally coded system 100 of exemplary embodiment of the present invention is from the computer terminal of above-mentioned voice data is provided in the service of podcast service or the audio content that similarly provides the method.Therefore, above-mentioned optionally coded system 100 receives above-mentioned voice data via the wire/wireless the Internet communication network from server.And, above-mentioned optionally coded system 100 above-mentioned second voice data of optionally encoding, or above-mentioned voice data sent in above-mentioned second user's terminating machine 120.In this case, above-mentioned second user's terminating machine 120 is for example mobile communication terminal machine, MP3 player, portable game station (PSP:Play StationPortable), portable media player (PMP:Portable Multimedia Player), personal digital assistant (PDA:Personal Digital Assistant) or electronic memos etc. of a mobile phone, and aforementioned calculation machine terminating machine transmits above-mentioned second voice data by being connected with above-mentioned second user's terminating machine 120.
Above-mentioned optionally coded system 100 according to exemplary embodiment of the present invention is separate servers of being scheduled to.Therefore, above-mentioned optionally coded system 100 receives above-mentioned voice data via the wire/wireless communication network from above-mentioned server 110, from above-mentioned voice data, optionally produce above-mentioned second voice data, or send above-mentioned original voice data to above-mentioned second user's terminating machine 120.In this case, above-mentioned second user's terminating machine 120 is above-mentioned mobile communication terminal machines, and above-mentioned optionally coded system 100 wirelessly sends above-mentioned second voice data to above-mentioned mobile communication terminal machine via data channel.
Therefore, above-mentioned optionally coded system 100 according to the present invention can have the memory efficient that improves above-mentioned second user's terminating machine 120, the influence of reduction transmission channel load etc.Specifically, above-mentioned optionally coded system 100 according to the present invention can be by reducing the total volume of above-mentioned speech data at the less bit rate above-mentioned speech data of only encoding again when above-mentioned speech data partially or even wholly is included in the above-mentioned voice data.
Fig. 2 illustrates according to of the present invention based on the operational flowchart of coding audio data method optionally.
In operation 201, exemplary embodiment server according to the present invention sends predetermined voice data to above-mentioned optionally coded system.Above-mentioned server provides the system of podcast service or RSS service.Therefore, above-mentioned optionally coded system is discerned the voice data tabulation of renewal by a predetermined cycle by discerning above-mentioned server, and asks above-mentioned voice data to be carried out transmission when the voice data of above-mentioned renewal exists.
In operation 202, above-mentioned optionally coded system receives voice data and analyzes data layout from above-mentioned server.Above-mentioned voice data comprises data such as for example broadcasting, music, song, voice.Therefore, whether above-mentioned voice data has a special attribute and above-mentioned specific properties to be sheared etc. and to decide a characteristic by analyzing frequency band, pitch detection, tut based on data layout.The characteristic of above-mentioned voice data decides by using above-mentioned traditional handicraft.
In operation 203, whether above-mentioned optionally coded system decides above-mentioned speech data to be included in the above-mentioned voice data based on the analysis of above-mentioned data layout.Whether whether above-mentioned optionally coded system be sheared etc. and decide above-mentioned speech data to be included in the above-mentioned voice data by analyzing above-mentioned frequency band, pitch detection, tut.Here, every part comprises an exponential sum whether above-mentioned index comprises above-mentioned voice data and all is recorded in the predetermined memory device.
And, in operation 203, when above-mentioned speech data was not comprised in above-mentioned voice data as the result of above-mentioned data layout analysis, by branch operation 206, above-mentioned optionally coded system sent above-mentioned voice data to second user's terminating machine.
When above-mentioned speech data partly or wholly is comprised in the above-mentioned voice data, above-mentioned optionally coded system in operation 204 by predetermined vocoder only encode with above-mentioned voice data in the corresponding part of above-mentioned speech data and in operation 205, produce above-mentioned second voice data.
According to the above-mentioned optionally coded system of exemplary embodiment of the present invention by the corresponding predetermined part of the above-mentioned speech data in above-mentioned vocoder coding and the above-mentioned voice data.For example, for the above-mentioned voice data of encoding with the corresponding center section of above-mentioned speech data, above-mentioned optionally coded system is by the above-mentioned vocoder above-mentioned center section of only encoding, and by sign that identifying information is for example predetermined or index information etc. be inserted into the reference position of above-mentioned center section or reconfigure transitional information for example sound sign indicating number information etc. produce above-mentioned second voice data.Specifically, when above-mentioned speech data partly is included in the above-mentioned voice data and above-mentioned music data when partly being included in the above-mentioned voice data, above-mentioned second voice data has a different bit rate by each partial section classification.For example, above-mentioned voice data can be encoded at 8Kbps bit rate and the corresponding part of above-mentioned speech data, also can be encoded at 128Kbps bit rate and the corresponding part of above-mentioned music data.
According to the above-mentioned optionally coded system of exemplary embodiment of the present invention when above-mentioned speech data with surpass the predetermined corresponding ratio of ratio and be comprised in above-mentioned voice data in, can with the corresponding bit rate of the above-mentioned speech data place above-mentioned total audio data of encoding.In this case, above-mentioned predetermined ratio can be set by the developer or the operator of above-mentioned optionally coded system.
In operation 206, above-mentioned optionally coded system sends second voice data of above-mentioned generation to above-mentioned second user's terminating machine.
Can on user's computer terminal, implement and above-mentioned second user's terminating machine can be for example mobile phone, PDA, electronic memo, PMP, PSP, a MP3 player etc. of mobile phone according to the above-mentioned optionally coded system of exemplary embodiment of the present invention.Exemplary embodiment of the present invention will be done in conjunction with Fig. 3 and describe in detail.
Fig. 3 illustrates according to the optionally coded system that comprises of the present invention, the exemplary plot of server and second user's terminal network.
With reference to figure 3, above-mentioned optionally coded system 300 can be implemented on terminal 310.Specifically, above-mentioned optionally coded system 300 is predetermined Application Software Program or the hardware that is arranged in aforementioned calculation machine terminal 310.Server 301 sends above-mentioned voice data to aforementioned calculation machine terminating machine 310 by a predetermined cycle via network 302 based on above-mentioned podcast service or above-mentioned RSS service.Above-mentioned network 302 can be considered to offer the wire/radio network of aforementioned calculation machine terminating machine 310 network communication abilities.Whether the above-mentioned speech data of aforementioned calculation machine terminating machine 310 decisions that receives above-mentioned voice data via network 302 is comprised in the above-mentioned voice data in the coded system 300 optionally.When above-mentioned speech data was comprised in the above-mentioned voice data, above-mentioned optionally coded system 300 produced above-mentioned second voice data by above-mentioned vocoder behind the above-mentioned voice data of coding.When above-mentioned second user's terminating machine was connected with aforementioned calculation machine terminating machine 310, aforementioned calculation machine terminating machine 310 sent above-mentioned second voice data that above-mentioned optionally coded system 300 produces to above-mentioned second user's terminating machine.Above-mentioned second user's terminating machine is the mobile phone with predetermined memory device, for example MP3 player 304, mobile communication terminal machine 305, recreation war 306 etc.
Above-mentioned second user's terminating machine by the short distance communication assembly for example USB assembly, RS-232C assembly, bluetooth module etc. be connected with above-mentioned optionally coded system 300, and above-mentioned optionally coded system 300 sends above-mentioned second voice data to above-mentioned second user's terminating machine by the connection of discerning above-mentioned second user's terminating machine.
The above-mentioned pen container of optionally encoding according to exemplary embodiment of the present invention is that separate server and the above-mentioned second user's terminating machine of being scheduled to is above-mentioned mobile communication terminal machine.Exemplary embodiment of the present invention will describe in detail in conjunction with Fig. 4.
Fig. 4 illustrates according to the optionally exemplary plot of coded system, server and second user's terminal network that comprises of the present invention.
With reference to figure 4, above-mentioned optionally coded system 400 receives predetermined voice data via network 402 from server 401.In this case, above-mentioned network 402 can be understood that to comprise on the broad sense all wire/wireless communication networks.
Similar with the exemplary embodiment of Fig. 3, whether the above-mentioned speech data of above-mentioned optionally coded system 400 decisions that receives above-mentioned voice data is included in the above-mentioned voice data, and produces above-mentioned second voice data by above-mentioned predetermined vocoder behind the above-mentioned voice data of coding when above-mentioned speech data is comprised in the above-mentioned voice data.And second voice data of above-mentioned generation is sent on above-mentioned second user's terminating machine by above-mentioned network 403.Above-mentioned second user's terminating machine is that mobile communication terminal machine 404 and above-mentioned network 403 comprise the wireless communication networks that contains system of predetermined communication provider.
Specifically, above-mentioned optionally coded system 400 requires system of above-mentioned communication provider to set up a channel that is connected with mobile communication terminal machine 404.Therefore, system of above-mentioned communication provider sets up wireless channel between above-mentioned optionally coded system 400 and above-mentioned mobile communication terminal machine 404, and above-mentioned optionally coded system 400 wirelessly sends above-mentioned second voice data to above-mentioned mobile communication terminal machine 404 by above-mentioned wireless channel.And, above-mentioned according to an exemplary embodiment of the present invention wireless communication terminal machine 404 is ask second voice data that Seeking Truth does not have the alternative coded system of above-mentioned transmission by a predetermined cycle, and the above-mentioned optionally coded system 400 of requirement transmits above-mentioned second voice data when having above-mentioned second voice data.At last, above-mentioned optionally coded system 400 according to the present invention is used by the internal memory that the capacity that reduces above-mentioned voice data effectively can reduce above-mentioned mobile communication terminal machine 404, also can lower the load that above-mentioned mobile communication network transmits channel.
Again with reference to figure 2, in operation 207, above-mentioned second user's terminating machine is based on decode above-mentioned second voice data and provide above-mentioned second voice data to above-mentioned user by predetermined speaker unit of above-mentioned transitional information.
Contain user's database according to the above-mentioned optionally coded system of exemplary embodiment of the present invention relevant at least one user's record.Above-mentioned user's information comprises the identifying information with the corresponding above-mentioned second user's terminating machine of above-mentioned user, and telephone number information can be used as an example of above-mentioned identifying information.Specifically, above-mentioned optionally coded system sends above-mentioned second user's terminating machine to by reading with reference to above-mentioned user's database with the corresponding user's information of above-mentioned second user's terminating machine and with second voice data of above-mentioned generation, and based on sending above-mentioned second user's terminating machine with the corresponding identifying information of above-mentioned user's information to above-mentioned second voice data is wireless.In this case, above-mentioned second user's terminating machine is the mobile communication terminal machine of above-mentioned mobile phone for example.
Fig. 5 illustrates the diagrammatic sketch of audio data format and second voice data according to an exemplary embodiment of the present invention.
With reference among the figure 5 with numeral 501 parts of representing, be ' A.MP3 ' according to the voice data of exemplary embodiment of the present invention.Above-mentioned ' A.MP3 ' comprises that a plurality of playlists and above-mentioned optionally coded system discern above-mentioned speech data and whether be included in the above-mentioned voice data by analyzing each playlist.For example, ' A.MP3 ' is an audio broadcasting and narration data and the music data that comprises the announcer.As the result who analyzes above-mentioned playlist, above-mentioned optionally coded system decision ' A1 ' and ' A3 ' is above-mentioned music data, and ' A2 ' and ' A4 ' is above-mentioned announcer's narration data.And the predetermined vocoder of above-mentioned optionally coded system utilization is that ' A1 ' and ' A3 ' of music data encodes and ' A2 ' and ' A4 ' also encoded to being judged to be.Specifically, each voice data that the analysis of above-mentioned optionally coded system is classified by each playlist is carried out different codings as the result who analyzes at each playlist.In this case, above-mentioned second user's terminating machine requires to have the function of playing each tabulation based on above-mentioned playlist again.Similar with the part of numeral 501 expressions, the voice data that comprises above-mentioned speech data can prevent that above-mentioned voice data is judged as the problem of above-mentioned music data or above-mentioned song data.
With reference among the figure 5 with the numeral 502 parts of representing, above-mentioned optionally coded system has been deleted above-mentioned playlist with numeral 501 parts of representing from above-mentioned, in each playlist, inserted and the relevant transitional information of coding, and the above-mentioned playlist of recombination and form a voice data.In situation, can the predetermined software of decoding via the above-mentioned voice data of a plurality of encoder encodes be needed with numeral 502 parts of representing.Because above-mentioned software is open and shared technology, describes in detail and will be omitted.
Fig. 6 illustrates according to an exemplary embodiment of the present invention the optionally block diagram of the internal configurations of coded system.
With reference to figure 6, comprise receiving element 601, converting unit 602 and delivery unit 603 according to the above-mentioned optionally coded system 600 of exemplary embodiment of the present invention.
Above-mentioned receiving element 601 receives voice data from book server.Above-mentioned server just provides for example above-mentioned voice data of voice, music, song, broadcasting etc. as the common server that voice data is provided.And above-mentioned voice data comprises all data that are encoded or not processed data.
Above-mentioned converting unit 602 judges by the data layout of analyzing the above-mentioned voice data that receives from above-mentioned receiving element 601 whether speech data is included in the above-mentioned speech data, and produces second voice data via predetermined vocoder by the above-mentioned voice data of encoding when above-mentioned speech data is included in the above-mentioned voice data.Judge according to the above-mentioned converting unit 602 of exemplary embodiment of the present invention whether a plurality of data that the above-mentioned voice data that receives are divided into based on predetermined tabulation are each speech datas.Therefore, differentiated coding is applied in above-mentioned a plurality of data and above-mentioned a plurality of data are generated above-mentioned second audio frequency individually.In this case, above-mentioned second voice data comprises the transitional information about above-mentioned vocoder and above-mentioned coding.
Instruction by the user is generated as above-mentioned second voice data via special scrambler with above-mentioned voice data according to the above-mentioned converting unit 602 of exemplary embodiment of the present invention.Above-mentioned user can set the error coded that above-mentioned voice data maybe will encode by special scrambler based on above-mentioned user's custom becomes above-mentioned second voice data.For example, above-mentioned user can set according to the song data that the content capacity of above-mentioned second user's terminating machine maybe will be encoded music data and be encoded to above-mentioned sound sign indicating number.
Above-mentioned delivery unit 603 sends second voice data of above-mentioned generation to above-mentioned second user's terminating machine.
Above-mentioned optionally coded system 600 according to exemplary embodiment of the present invention is comprised in the predetermined computation machine terminating machine of Application Software Program or type of hardware.Specifically, above-mentioned receiving element 601 usefulness wire/wireless forms receive above-mentioned voice data via the Internet communication network from predetermined server, and above-mentioned converting unit 602 is judged whether above-mentioned speech data is included in the above-mentioned voice data and when above-mentioned speech data is comprised in the above-mentioned voice data and produced above-mentioned second voice data via above-mentioned vocoder by the above-mentioned voice data of encoding.Therefore, when above-mentioned second user's terminating machine via the short distance communication assembly, when for example USB assembly, RS-232C assembly, ultra broadband (UWB) assembly, bluetooth module, WLAN (wireless local area network) (LAN) etc. were connected, above-mentioned delivery unit 603 sent above-mentioned second voice data to above-mentioned second user's terminating machine.
Above-mentioned optionally coded system 600 according to exemplary embodiment of the present invention is predetermined independently servers.Therefore, above-mentioned receiving element 601 receives above-mentioned voice data via the wire/wireless communication network from above-mentioned server, and whether above-mentioned converting unit 602 is included in according to above-mentioned speech data and produces above-mentioned second voice data in the above-mentioned voice data.And then above-mentioned delivery unit 603 wirelessly sends above-mentioned second voice data to above-mentioned second user's terminating machine.Above-mentioned second user's terminating machine comprises mobile communication terminal machine, public translation telephone network (PSTN) terminating machine, the networking telephone (VoIP), SIP (SIP), media gateway controlling (Megaco), personal digital assistant (PDA:Personal Digital Assistant), mobile phone, person-to-person communication service (PCS:Personal Commuincation Service) phone, handheld personal computers (Hand-Held PC), CDMA (CDMA)-2000 (1X, 3X) phone, wideband CDMA (WCDMA:Wideband CDMA) phone, biobelt/dual model (Dual Band/Dual Mode) phone, global system for mobile communications (GSM:Global Standard for Mobile) phone, mobile broadband system (MBS:Mobile Broadband System) phone, satellite/earth DMB (DMB:Digital Multimedia Broadcasting) phones etc. are as a predetermined communication terminal machine.
Above-mentioned optionally coded system 600 according to exemplary embodiment of the present invention further comprises user's database 604 and database management unit 605.
Above-mentioned user's database 604 contains the user's information relevant at least one user.Above-mentioned user's information comprises the identifying information with the corresponding above-mentioned second user's terminating machine of above-mentioned user.And, above-mentioned database management unit 605 is by reading and the corresponding above-mentioned user's information of above-mentioned second user's terminating machine with reference to above-mentioned user's database 604, control above-mentioned delivery unit 603, and based on wirelessly sending above-mentioned second voice data to above-mentioned second user's terminating machine with the corresponding above-mentioned identifying information of above-mentioned user's information.
For example, above-mentioned delivery unit 603 carries out the above-mentioned user's database 604 of the analysis of sentence and then wirelessly sends above-mentioned second voice data to above-mentioned second user's terminating machine from grammer, reads predetermined above-mentioned user's information.Above-mentioned user's information comprises above-mentioned identifying information, the telephone number information of for example above-mentioned second user's terminating machine etc., and above-mentioned delivery unit 603 based on above-mentioned identifying information for example above-mentioned telephone number information etc. send above-mentioned second voice data to above-mentioned second user's terminating machine.
Fig. 7 illustrates according to internal frame diagram that can adopted general calculation machine in using method optionally of the present invention.
Computer installation 700 comprise at least one with contain RAM (Random Access Memory: random access memory) 720 with ROM (Read Only Memory: the ROM (read-only memory)) processor 710 that is connected of 730 main memory device.Above-mentioned processor 710 is also referred to as CPU (central processing unit) (CPU).Technical field just as is known, above-mentioned ROM 730 uniaxiallies are passed to above-mentioned CPU with data and instruction, and above-mentioned RAM 720 is often used as and transmits data and instruction two-wayly.Above-mentioned RAM 720 and above-mentioned ROM 730 can comprise certain suitable form of computer readable recording medium storing program for performing.Large capacity equipment 740 is connected with above-mentioned processor 710 two-wayly and is used to provide extra data storage capacity and can is a kind of in a plurality of computer-readable recording mediums.Above-mentioned large capacity equipment 740 is used as stored programme, data etc., and is a standby memory device, for example moves hard disk comparatively slowly than above-mentioned main memory device usually.Special large capacity equipment for example CD ROM 760 also can be used.For example video display, trackball, mouse, keyboard, microphone, touch screen type display, card reader, tape or paper tape reader, voice or handwriting recognizer, joystick or other known computing machine I/O units are connected above-mentioned processor 710 with at least one input/output interface 750.Above-mentioned processor 710 can be connected with the wire/wireless communication network by network interface 770.The step of the method for foregoing description can connect by above-mentioned network and realizes.Said apparatus and instrument are common general knowledge for the technician of computer hardware and software technology field.
Above-mentioned hardware unit is for the one or more component softwares of operation since carrying out aforesaid operations of the present invention and can disposing accordingly.
Although shown and described the present invention with reference to its certain exemplary embodiments, but it should be appreciated by those skilled in the art, or else break away under the situation by the spirit and scope of the present invention of claim definition, can carry out various changes on form and the details it.
Commercial Application
One aspect of the present invention provides a kind of method and system of service efficiency of memory device of the mobile phone that improves recording audio data.
Another aspect of the present invention also provides based on above-mentioned voice data characteristic by optionally reducing the load of wireless communication networks by coding audio data in the coded system, and above-mentioned voice data is sent to the method and system of second user's terminating machine by above-mentioned wireless communication networks.