WO2014085868A1

WO2014085868A1 - System and method for insertion of media into a voice channel

Info

Publication number: WO2014085868A1
Application number: PCT/AU2013/001423
Authority: WO
Inventors: Robert Mark KING
Original assignee: King Robert Mark
Priority date: 2012-12-07
Filing date: 2013-12-06
Publication date: 2014-06-12

Abstract

A system for insertion of media into voice channel the system including a first input and output apparatus for receiving voice input from a first user and delivering voice output from a second or further user, at least one second input and output apparatus for receiving voice input from a second user and delivering voice output from a first or further user, a voice channel established between the first input and output apparatus and the at least one second input and output apparatus and over which two way communication between the first user and second or further users can take place, an access device to access a stored media file containing media for insertion into the voice channel and an insertion mechanism to insert at least a portion of the stored media file into the voice channel established for contemporaneous delivery whilst the voice channel is established.

Description

SYSTEM AND METHOD FOR INSERTION OF MEDIA INTO A VOICE CHANNEL TECHNICAL FIELD

[0001] The present invention relates generally to systems and methods of voice

communication over distance and particularly to systems and methods of voice communication with media inserted into the voice channel for contemporaneous delivery to a user.

BACKGROUND ART

[0002] Typically, a device such as a mobile telephone includes an antenna for receiving and transmitting radio frequency signals, a radio frequency signal processing part for converting incoming analogue signals into digital signals using an analogue to digital converter and for converting outgoing digital signals into analogue signals using a digital to analogue converter. The mobile telephone also typically includes a modem or similar device for controlling the mobile telephone, a coder-decoder for coding and decoding the digital signals, a speaker for outputting voice and a microphone for inputting user's voice.

[0003] Normally, once the user's voice is input into the microphone, it is processed by sampling process and digitalized in the coder-decoder using a pulse code modulation and then the modulated signal is sent to the mobile phone modem. Consequently, the mobile phone modem and codes the modulated signals using an algorithm at a particular data rate and then sends the data to the radio frequency processing part. The radiofrequency processing part receives the data from the mobile telephone modem on a carrier wave in order to transmit the data to a remote location using the appropriate mobile phone transmission technology.

[0004] Incoming signals are decoded in reverse order of the outgoing signal process so that they can be delivered through the speaker or an earphone by counterpart mobile telephone.

[0005] In order to make the communication experience more relaxed and enjoyable, or to provide the possibility of providing entertainment and/or advertising during the communication experience, insertion of media into a voice channel during the conversational communication but at a background level is proposed.

[0006] It will be clearly understood that, if a prior art publication is referred to herein, this reference does not constitute an admission that the publication forms part of the common general knowledge in the art in Australia or in any other country. SUMMARY OF INVENTION

[0007] The present invention is directed to a system and method for insertion of media into voice channel, which may at least partially overcome at least one of the abovementioned disadvantages or provide the consumer with a useful or commercial choice.

[0008] With the foregoing in view, the present invention in one form, resides broadly in a system for insertion of media into voice channel the system including

(a) a first input and output apparatus for receiving voice input from a first user and delivering voice output from a second or further user,

(b) at least one second input and output apparatus for receiving voice input from a second user and delivering voice output from a first or further user,

(c) a voice channel established between the first input and output apparatus and the at least one second input and output apparatus and over which two way communication between the first user and second or further users can take place;

(d) an access device to access a stored media file containing media for insertion into the voice channel; and

(e) an insertion mechanism to insert at least a portion of the stored media file into the voice channel established for contemporaneous delivery whilst the voice channel is established.

(0009) In an alternative form, the present invention resides in a method for insertion of media into voice channel, the method including the steps of

(a) receiving voice input from a first user and delivering voice output from a second or

further user using a first input and output apparatus,

(b) receiving voice input from a second user and delivering voice output from a first or

further user using at least one second input and output apparatus,

(c) establishing a voice channel between the first input and output apparatus and the at least one second input and output apparatus and over which two way communication between the first user and second or further users can take place;

(d) accessing an access device to access a stored media file containing media for insertion into the voice channel; and

(e) inserting at least a portion of the stored media file into the voice channel established for contemporaneous delivery whilst the voice channel is established using an insertion mechanism.

[0010] The voice channel used will typically be electronically implemented voice channel created or accessed between at least two electronic devices. The voice channel may be on a one- to-one basts, a one to many basts, many to one basis or many to many basis.

[0011 J Any type of device or apparatus may be used as a part of the system of the present invention. Typically however devices having voice channel capability are used, the most common of which are a telephone, whether mobile or fixed, or radio or similar device that allows access to a telephony or wireless network.

[0012J Further, the media can be of any type. Typically, the media will be or include music or similar to be played or otherwise delivered in the background of a voice communication.

[0013] Typically, the media will be delivered when a user activates or authorises the establishment of a voice channel. That is, the media will preferably begin before the voice channel is established if the user is activating an outgoing transmission, while the user is waiting for the establishment of the voice channel, and is deli vered to the recipient that soon as the voice channel is established.

[001 ] The present invention preferably operates on a number of different levels having elements of the invention specific to each of the levels. It is therefore convenient to deal with the invention in the context of those levels.

[0015] The preferred levels include:

• A physical level;

• A data link level; and

• A signal level.

[0016] The physical level of the present invention defines electrical and physical specifications for devices which are used in according to the invention. In particular, it defines the relationship between a device and a transmission medium, such as a copper or fibre optical cable or wireless network. This includes the voltages, line impedance, cable specifications, signal timing, repeaters, network adapters, and other parameters that relate to the physical transmission of a signal between two or more communications devices.

[0017] The major functions and services performed by the physical level, that is the hardware or devices that form a part of the system of the invention, are:

• Establishment and termination of a connection to a communications medium.

• Participation in the process whereby the communication resources are effectively shared among multiple users, for example, contention resolution and ilow control.

• Modulation or conversion between the representation of digital data in user equipment and the corresponding signals transmitted over a communications channel. These are signals operating over the physical cabling (such as copper and optical fibre) or over a wireless network.

|0018| Typically, the physical level will include devices or components upon which the system functions or operates or those that are required to implement the present. Normally, one or more communications devices are used. The communications devices may be a mobile or fixed device. The com unications devices typically have access to one or more communications pathways. One or more of the communications pathways can be wireless or hardwired or a combination of these. For example, a wireless communication network or channel may be established using one or more communications devices accessible by user, one or more base or repeater stations and one or more communications devices accessible by a recipient. In the hardwired configuration, normally a user will have access to one or more communications devices which will be hardwired to an exchange or similar and the exchange will be hardwired to one or more communications devices accessible by a recipient.

10019| As mentioned above, normally the communications devices used in the present invention will be a telephone or radio or another device allowing voice transmission and receipt of such as a computing device or similar.

[0020J The syste of the present invention may be implemented using devices with computer processors such as telephones, portable or desktop computers, personal data storage devices, MP3 and MP4 players, audio-visual storage and display devices, televisions and the like. Provided that a device that can access communications pathways for the transmission of a signal, that device may be used according to the system of the present invention.

[0021 ] Preferably, the system of the present invention will be implemented across any network and will typically be network independent, with no network prevented from access to the system but no network preferred either.

[00221 The preferred communications devices used to implement the present invention typically include a memory which stores one or more instructions and an associated computer processor in order to process and implement the instructions stored in memory. In addition to the instructions, the memory may also store the media to be inserted. Typically, where provided in the same memory as the instructions, the media is stored in a different portion of the memory. Alternatively, the media may be stored separately from the memory storing the instructions. Further, the memory storing the media may not be on board the communications device but instead may be accessed from a remote store of media. Typically however, speed will be normally be increased if the media is stored on the communications device of one or more of the users.

[0023] Preferably, a portion of the instructions stored in the memory of the communications device will be capable of producing an interface on the device which allows control and operation of the in vention. The interface will typically be an interface which is produced and displayed according to instructions stored in memory and which is updated when the information related to the interface is changed or updated.

1002 1 According to a particularly preferred embodiment of the present invention, the user of a particular communications device will set preferences according to which the processor operates in relation to implementation of the invention. In particular, the user of a particular communications device will normally set one or more generic preferences and/or one or more specific preferences. Generally, the user will have a set of generic preferences which are applied to all transmissions if no specific preferences are set. However if specific preferences are set in relation to one or more transmissions, then the specific references are applied.

[0025] The generic preferences will normally include a type of media to be transmitted for example, whether it is sound only transmitted or sound and vision, or vision only or any combination of different types of media. Another generic preference which may be set can be the genre of the media allowing the user to define that media of a particular genre should be transmitted.

[0026] A media database will preferably be provided either on a communications device or remotely and containing pieces of media with each piece of media having one or more media profile identifiers. The user may define particular pieces of media to be transmitted either as part of the generic preferences or specific preferences or the user can set parameters allowing a software application to select media from a larger pool of media either at random, or according to a particular design.

[0027] The pieces of media (which may be simply referred to as "media") may have any form including music, audio, video, books, still images, snippets, or portions of the above. Basically, the media can have any form provided that the content is electronic or digital in order that the media can be transmissible using the system between users. Normally, each piece of media will be stored in an electronic file and it is this electronic file which wiil typically be capable of transmission, dissemination or copying or the like onto a physical medium.

[0028] The media will typically be provided from any source. For example, an entertainment studio, TV network, Internet social networking systems or record labels may choose to provide media to the system. Alternatively, smaller organisations or individuals may choose to provide content directly to the system such as home movies, demo music tracks or the like.

[0029] The media may be provided directly to the system or indirectly.

[0030] Each piece of media will also have one or more media profile identifiers which will typically be referred to as "tags". Preferably, the tags will identify the media allowing the user to set parameters by which media can be identified.

[0031 ] Each piece of media will typically be tagged according to its "type". Each piece of media will preferably be tagged according to its Genre, Mood, Style and/or Theme. For example, the Genre tags will normally be those used in a particular industry such as music genre including pop, classical, rock, rhythm and blues, house/techno, folk and the like. Each of these broad genre categories may include at one or more sub-genre. There are also typically genre associated with video games, television, film, literature and the like. Therefore, each piece of media will typically be tagged according to its broad type and then further tagged according to genre. Normally, the genre tagging will also fit with the accepted genres in industry.

[0032] Mood tagging may be used to represent the relatively long-lasting, effective or emotional state that the content offers for example fun, cheerful, humorous, gentle, scary, thought provoking, reflective and the like.

[0033] Style tagging will preferably identify the pieces of media for composition or format. Genres have been used to identify "style", but the system of the present invention will typically tag the media with more complexity. For example, a song may be a rock genre song but maybe further classified using style tagging as modern rock or contemporary rock and may indicate the basic style of the medi .

[0034] Theme tagging will typically identify the broad idea, message or lesson conveyed in the piece of media. For example the piece of media may be comforting, relaxing, suggestive or the like. [0035] Importantly, tagging systems have been used conventionally but the tagging systems vary depending upon the provider of the media and the various entertainment types. The system of the present invention will typically provide or utilise a standard for media tagging.

[0036] Preferably, each piece of media will be initially tagged upon uploading or the first provision of that piece of media to the system. There may be an analysis process upon uploading in order to check to see whether a particular piece of media which has.been submitted is not already present on the system or in the memory accessible by the communications device.

[0037] Alternatively, the media may be chosen by contextual data. For example, the media may be chosen according to contextual data related to the brand, content, genre, theme, country, city, age group or gender as examples. The media may be chosen according to user contextual data which relates to their target demographic consumer such as country, city, postcode, page, favourite pastimes, or the like. Still further, the media may be chosen at according to media contextual data such as by label, publisher, artist, mood, or through one or more restrictions such as a age restrictions, or the like.

[0038] Alternatively or in addition thereto, the media may be chosen according to geographical and/or demographic popularity. According to the selection criteria, the user may review the popularity of media by geographical demographic segment and select media accordingly. Normally, a user will set specific preferences for one or more third parties.

[0039] Other preferences can be set as welt such as the delivery volume although this may be adjustable whilst the voice channel between users is active.

[0040] Preferably, the generic preferences applied to all transmissions utilising the system of the invention except those in relation to which specific preferences have been set. Alternatively, the generic preferences for a particular communications device may be set by the manufacturer or seller of the device and may be adjustable by user.

[0041] A user can also preferably set specific preferences. Normally, the specific preferences will be set using similar parameters to the generic preferences. However, additional preferences may be used for the specific preferences. Normally, specific preferences are set in relation to specific third parties or groups of third parties with which the user may contact more regularly or alternatively those to which a user may have a particular relationship. Normally, the specific preferences are stored for use when an outgoing transmission is made or an incoming transmission is received from one or more specified third parties. [0042] The preferences are typically set using the interface as described above. The interface may allow the setting of preferences using data entry fields allowing input of preference data into the memory used according to the present invention or alternatively, a number of selection options may be presented to a user allowing the user to select from one or more options to set the preference data.

[0043] The activation of outgoing transmissions will normally also be activated using the interface. Similarly, receipt of incoming transmissions will normally be governed by the preference information but typically, there will be a trigger to accept an incoming transmission which requires action by the user before the voice channel is established.

[0044| In practice, the preferred embodiment of the present invention will operate as follows: the user will use the communications device and specifically, the interface on the communications device to indicate that they wish to make an outgoing transmission to a particular recipient. The system will then check to see whether the recipient is a recognised the third-party or not and if a recognised third party, will check to see whether specific preferences have been set up for the recognised third party, if the recipient is a recognised third party without specific preferences or if the recipient is not a recognised third party, then the system of the present invention will locate the generic preferences and enable them for use in the transmission. As mentioned above, if the recipient is a recognised third party with specific preferences, those specific preferences will be located and enabled for use within the transmission. Once the preferences have been located and enabled, the voice channel will then typically be established between the communications device of the first user and a communications device of a second or further user with the applicable preferences for use during the transmission.

[0045] If the user is receiving an incoming transmission, normally, the same process will be followed in that the system will check to see whether the transmission sender ears a recognised third party or not and if a recognised third party, will check to see whether specific preferences have been set up for the recognised third party. If the transmission sender is a recognised third party without specific preferences or if the transmission sender is not a recognised third party, then the system of the present invention will locate the generic preferences and enable them for use in the transmission. As mentioned above, if the transmission sender is a recognised third party with specific preferences, those specific preferences will be located and enabled for use within the transmission. Once the preferences have been located and enabled, the voice channel will then typically be established between the communications device of the first user and a communications device of a second or further user with the applicable preferences for use during the transmission.

[0046] Normally, the preferences are enabled and the media is inserted into the voice channel for the duration of the call and the media insertion into the voice channel ceases when the voice channel is closed.

[0047J The data link level preferably provides the functional and procedural means to transfer signals and/or data between entities using the invention and to detect and possibly correct errors that may occur in the physical level.

[0048] Preferably, the data link level of the invention will govern and control the conditions of establishing and maintaining the voice channel and the rules for use of the voice channel between the communications devices and any other communications components.

[0049] Normally, the voice channel may be transmitted or established on any band or wavelength using any mechanism and the invention is not limited by the band, wavelength or mechanism of establishment or maintenance of the voice channel.

[0050] It is to be noted that many of the functional and procedural rules in relation to the data link level of the present invention are set and maintained by service providers providing the particular channel and normally, the invention will utilise those rules and operate accordingly. In other words, the data link level and the rules applicable to the data link level will normally be those piggybacked from existing services.

[0051] Typically, one of the communications devices used in the present invention initiates the establishment of a voice channel by communication of an invitation to a second or other communication device and a voice channel is only actually established after the one or more recipient users except the invitation through operation of their communication device.

[0052] Typically, the media will be deli vered to the initiator of the establishment of the voice channel at initialisation or issuance of the invitation and will begin to be inserted into the voice channel such that it is delivered to the recipient(s) as soon as the invitation is accepted.

[0053] Normally, a voice channel may be established and delivered through, on one or more waypoints or transmission steps or locations between the users.

[0054] As mentioned above, any channel, band or bandwidth range may be used. Typically, a voice channel established in a telephony sense is established as a fixed frequency or band. The frequency or band may differ if the voice channel is established over a landline or a mobile or wireless communications pathway. The media may be inserted into the voice channel in any way using any mechanism. For example, the media may be inserted into the voice channel by integration with the existing voice transmission or into or over a carrier transmission.

[0055] Although deal with in more detail at the signal level, different layers may be provided in a single transmission, one for voice transmission and one for media transmission with the layers capable of being separated or read separately by the communications device. Preferably, the voice transmission and the media transmission will be provided in a single transmission but maintained separately within that single transmission in order to be transmitted on the same channel.

[0056] The data link level can preferably keep track of the signals and retransmit those that fail. The data link level also typically ensures the acknowledgement of successful transmission and sends the next data if no errors occur.

[0057] The signal level is the level at which the features or properties of the actual signal transmitted and received are defined. Preferably, the properties of the signal that is transmitted across the voice channel are provided and defined. Any type or configuration of signal can be used. The signal can be analogue or digital. The signal can be a simple signal, a layered signal, a composite signal or a conjugated signal. Preferably, the signal will have hierarchical layering or modulation allowing users with more complex communications devices to access higher levels of the signal than those with simpler communications devices. This may give access to additional portions of the signal upon which other preferences or more complex media may be transmitted.

[0058] Typically, a signal in electrical form is made by a transducer that converts the signal from, whatever is, its original form to a waveform expressed as a current (1) or a voltage (V), or an electromagnetic waveform, for example, an optical signal or radio transmission. Once expressed as an electronic signal, the signal is available for further processing by electrical devices such as electronic amplifiers and electronic filters, and can be transmitted to a remote location by electronic transmitters and received using electronic receivers. Typically, the voice and the media to be transmitted will be combined to produce a signal which can then be transmitted.

[0059] Multiplexing technologies may be used according to the present invention.

Multiplexing technology is divided into several types, all of which have significant variations: space-division multiplexing (SDM), frequency-division multiplexing (FDM), time-division multiplexing (TDM), polarization-division multiplexing, orbital angular momentum multiplexing and code division multiplexing (CDM) and any of which may be used. Variable bit rate digital bit streams may be transferred efficiently over a fixed bandwidth channel by means of statistical multiplexing, for example packet mode communication. Packet mode communication is an asynchronous mode time-domain multiplexing which resembles time-division multiplexing.

[0060] Digital bit streams can be transferred over an analogue channel by means of code- division multiplexing (CDM) techniques such as frequency-hopping spread spectrum (FHSS) and direct-sequence spread spectrum (DSSS).

[0061 ] In wireless communications, multiplexing can also be accomplished through alternating polarization (horizontal/vertical or clockwise/counter clockwise) on each adjacent channel and satellite, or through phased multi-antenna array combined with a multiple-input multiple-output communications (MIMO) scheme.

[0062] There are some functions or services that are not tied to a given level, but they can affect more than one level. Examples include the following:

(a) security service; and

(b) management functions, i.e. functions that permit to configure, instantiate, monitor, terminate the communications of two or more devices.

[0063] Insertion of the media into the signal to be transmitted over the voice channel will typically be accomplished by suitable equipment. Normally, the media will be inserted into or combined with the signal and then the combined signal will preferably be transmitted according to more or less conventional transmission techniques.

[0064] Any of the features described herein can be combined in any combination with any^¬ one or more of the other features described herein within the scope of the invention.

[0065] The reference to any prior art in this specification is not, and should not be taken as an acknowledgement or any form of suggestion that the prior art forms part of the common general knowledge.

BRIEF DESCRIPTION OF DRAWINGS

[0066] Various embodiments of the invention will be described with reference to the following drawings, in which:

[0067] Figure 1 is a generic schematic illustration of a mobile phone system of communication according to a possible embodiment of the present invention.

[0068] Figure 2 is a schematic flowchart of the preference selection algorithm according to a preferred embodiment of the present invention.

[0069] Figure 3 is a schematic flowchart of the implementation of the method for insertion of media into a voice channel according to a preferred embodiment of the present invention.

DESCRIPTION OF EMBODIMENTS

[0070] According to a particularly preferred embodiment of the present invention, a sister meant method for insertion of media into a voice channel is provided.

[0071] The preferred embodiment of the present invention is a system for insertion of media into voice channel the system including a first input and output apparatus for receiving voice input from a first user and delivering voice output from a second or further user, at least one second input and output apparatus for receiving voice input from a second user and delivering voice output from a first or further user, a voice channel established between the first input and output apparatus and the at least one second i put and output apparatus and over which two way communication between the first user and second or further users can take place an access device to access a stored media file containing media for insertion into the voice channel and an insertion mechanism to insert at least a portion of the stored media file into the voice channel established for contemporaneous delivery whilst the voice channel is established.

10072] The voice channel used will typically be electronically implemented voice channel created or accessed between at least two electronic devices. The voice channel may be on a one- to-one basis, a one to man basis, many to one basis or many to many basis.

[0073] Any type of device or apparatus may be used as a part of the system of the present invention. Typically however devices having voice channel capability are used the most common of which are a telephone, with a mobile or fixed or radio or similar device that allows access to a telephony or wireless network.

[0074] Further, the media can be of any type. Typically, the media will be or include music or similar to be played in the background of a voice communication.

[0075] Typically, the media will be delivered when a user activates or authorises the establishment of a voice channel. That is, the media will preferably begin before the voice channel is established if the user is activating an outgoing transmission while the user is waiting for the establishment of the voice channel and is delivered to the recipient that soon as the voice channel is established.

[0076] The present invention preferably operates on a number of different levels having elements of the invention specific to each of the levels, it is therefore convenient to deal with the invention in the context of those levels.

10077] Normally, one or more communications devices are used. The communications devices may be a mobile or fixed device. The communications devices typically have access to one or more communications pathways. One or more of the communications pathways can be wireless or hardwired or a combination of these. For example, a wireless communication network or channel may be established using one or more communications devices accessible by user, one or more base or repeater stations and one or more communications devices accessible by a recipient. In the hardwired configuration, normally a user will have access to one or more communications devices which will be hardwired to an exchange or similar and the exchange will be hardwired to one or more communications devices accessi ble by a recipient.

[0078] As mentioned above, normally the communications devices used in the present invention will be a telephone or radio or another device allowing voice transmission and receipt of such as a computing device or similar.

|0079| Preferably, the system of the present invention will be implemented across any network and will typically be network independent, with no network prevented from access to the system but no network preferred either.

[0080] A schematic illustration of one manner of transmitting a signal is illustrated in Figure 1 . In this i llustration, a smart phone 10 of the first user transmits a signal 1 1 via an uplink or base station 12 the base station 12 transmits the signal to a second base station 12 prime which then retransmits the signal to one or more recipient smartphones 13.

[0081 ] The preferred smartphones used to implement the present invention include a memory which stores one or more instructions and an associated processor in order to process and implement the instructions stored in memory. In addition to the instructions, the memory may also store the media to be inserted. Typically, where provided in the same memory as the instructions, the media is stored in a different portion of the memory. Alternatively, the media may be stored separately from the memory storing the instructions. Further, the memory storing the media may not be on board the communications device but instead may be accessed from a remote store of media. Typically however, speed will be increased if the media is stored on the communications device of one or more of the users.

[0082] Preferably, a portion of the instructions stored in the memory of the communications device will be capable of producing an interface on the device which allows control and operation of the invention. The interface will typical ly be an interface which is produced and displayed according to instructions stored in memory and which is updated when the information related to the interface is changed or updated.

[0083] According to a particularly preferred embodiment of the present invention, the user of a particular communications device will set preferences according to which the processor operates in relation to implementation of the invention. In particular, the user of a particular communications device will normally set one or more generic preferences and/or one or more specific preferences. Generally, the user will have a set of generic preferences which are applied to all transmissions if no specific preferences are set. However if specific preferences are set in relation to one or more transmissions, then the specific references are applied.

[0084] The generic preferences will normally include a type of media to be transmitted for example, whether it is sound only transmitted or sound and vision, or vision only or any combination of different types of media. Another generic preference which may be set can be the genre of the media allowing the user to define that media of a particular genre be transmitted.

[0085] A media database will preferably be provided either on a communications device or remotely and containing pieces of media with each piece of media having one or more media profile identifiers. The user may define particular pieces of media to be transmitted either as part of the generic preferences or specific preferences or the user can set parameters allowing a software application to select media from a larger pool of media either at random, or according to a particular design.

[0086] The pieces of media (which may be simply referred to as "media") may have any form including music, audio, video, books, still images, snippets, or portions of the above. Basically, the media can have any form provided that the content is electronic or digital in order that the media can be transmissible using the system between users. Normally, each piece of media will be stored in an electronic file and it is this electronic file which will typically be capable of transmission, dissemination or copying or the like onto a physical medium.

[0087] The media will typically be provided from any source. For example, an entertainment studio, TV network, Internet social networking systems or record labels may choose to provide media to the system. Alternatively, smaller organisations or individuals may 1.5 choose to provide content directly to the system such as home movies, demo music tracks or the like.

[0088] The media may be provided directly to the system or indirectly.

[0089] Each piece of media will also have one or more media profile identifiers w hich will typically be referred to as "tags". Preferably, the tags will identify the media allowing the user to set parameters by which media can be identified.

[0090J Other preferences can be set as well such as the delivery volume although this may be adjustable whilst the voice channel between users is active.

[0091 J Preferably, the generic preferences applied to all transmissions utilising the system of the invention except those in relation to which specific preferences have been set. Alternatively, the generic preferences for a particular communications device may be set by the manufacturer or seller of the device and may be adjustable by user.

[0092] A user can also preferably set specific preferences. Normally, the specific preferences will be set using similar parameters to the generic preferences. However, additional preferences may be used for the speci fic preferences. Normally, specific preferences are set in relation to specific third parties or groups of third parties with which the user may contact more regularly or alternatively those to which a user may have a particular relationship. Normally, the specific preferences are stored for use when an outgoing transmission is made or an incoming transmission is received from one or more specified third parties.

[0093] The preferences are typically set using the interface as described above. The interface may allow the setting of preferences using data entry fields allowing input of preference data into the memory used according to the present invention or alternatively, a number of selection options may be presented to a user allowing the user to select from one or more options to set the preference data.

[0094] One preferred method of setting the preferences both generic and third-party specific as illustrated in Figure 2.

[0095] The activation of outgoing transmissions will normally also be activated using the interface. Similarly, receipt of incoming transmissions will normally be governed by the preference information but typically, there will be a trigger to accept an incoming transmission which requires action by the user before the voice channel is established. (0096) In practice, the preferred embodiment of the present invention will operate as follows: the user will use the communications device and specifically, the interface on the communications device to indicate that they wish to make an outgoing transmission to a particular recipient. The system will then check to see whether the recipient is a recognised the third-party or not and if a recognised third party, will check to see whether specific preferences have been set up for the recognised third party. If the recipient is a recognised third party without specific preferences or if the recipient is not a recognised third party, then the system of the present invention will locate the generic preferences and enable them for use in the transmission. As mentioned above, if the recipient is a recognised third party with specific preferences, those specific preferences will be located and enabled for use within the transmission. Once the preferences have been located and enabled, the voice channel will then typically be established between the comm unications device of the first user and a communications device of a second or further user with the applicable preferences for use during the transmission.

[0097] If the user is receiving an incoming transmission, normally, the same process will be followed in that the system will check to see whether the transmission sender ears a recognised third party or not and if a recognised third party, will check to see whether specific preferences have been set up for the recognised third party. If the transmission sender is a recognised third party without specific preferences or if the transmission sender is not a recognised third party, then the system of the present invention will locate the generic preferences and enable them for use in the transmission. As mentioned above, if the transmission sender is a recognised third party with specific preferences, those specific preferences will be located and enabled for use within the transmission. Once the preferences have been located and enabled, the voice channel will then typically be established between the communications device of the first user and a communications device of a second or further user with the applicable preferences for use during the transmission.

[0098] Normally, the preferences are enabled and the media is inserted into the voice channel for the duration of the call and the media insertion into the voice channel ceases when the voice channel is closed by ending the transmission.

[0099] A particularly preferred embodiment of implementation of the invention is illustrated in Figure 3.

[00100] Normally, the voice channel is transmitted or established on any band or wavelength using any mechanism and the invention is not limited by the band, wavelength or mechanism of establishment or maintenance of the voice channel. 1.7

[001011 . It is to be noted that many of the functional and procedural rules in relation to the data link level of the present invention are set and maintained by service providers providing the particular channel and normally, the invention will utilise those rules and operate accordingly. In other words, the data link level and the rales applicable to the data link level will normally be those piggybacked from existing services.

(00102) Typically, one of the communications devices used in the present invention initiates the establishment of a voice channel by communication of an invitation to a second or other communication device and a voice channel is only actually established after the one or more recipient users except the invitation through operation of their communication device.

[00103] Typically, the media will be delivered to the initiator of the establishment of the voice channel at initialisation or issuance of the invitation and will begin to be inserted into the voice channel such that it is delivered to the recipient(s) as soon as the invitation is accepted.

[00104] Normally, a voice channel may be established and delivered through, on one or more waypoints or transmission steps or locations between the users.

[00105] As mentioned above, any channel, band or bandwidth range may be used. Typically, a voice channel established in a telephony sense is established as a fixed frequency or band. The frequency or band may differ if the voice channel is established over a landline or a mobile or wireless communications pathway. The media may be inserted into the voice channel in any way using any mechanism. For example, the media may be inserted into the voice channel by integration with the existing voice transmission or into or over a carrier transmission.

[00106] Although dealt with in more detail at the signal level, different layers may be provided in a single transmission, one for voice transmission and one for media transmission with the layers capable of being separated or read separately by the communications device. Preferably, the voice transmission and the media transmission will be provided in a single transmission but maintained separately within that single transmission in order to be transmitted on the same channel.

[00107] The signal level is the level at which the features of the actual signal transmitted and received are defined. Preferabl y, the properties of the signal that is transmitted across the voice channel are provided and defined. Any type or configuration of signal can be used. The signal can be analog or digital. The signal can be a simple signal, a layered signal, a composite signal or a conjugated signal. Preferably, the signal will have hierarchical layering or modulation allowing users with more complex communications devices to access higher levels of the signal than those with simpler communications devices. This may give access to additional portions of the signal upon which other preferences or more complex media may be transmitted.

[00108] Typically, a signal in electrical form is made by a transducer that converts the signal from, whatever is, its original form to a waveform expressed as a current (I) or a voltage (V), or an electromagnetic waveform, for example, an optical signal or radio transmission. Once expressed as an electronic signal, the signal is available for further processing by electrical devices such as electronic amplifiers and electronic filters, and can be transmitted to a remote location by electronic transmitters and received using electronic receivers. Typically, the voice and the media to be transmitted will be combined to produce a signal which can then be transmitted.

[00109] Insertion of the media into the signal to be transmitted over the voice channel will typically be accomplished by suitable equipment. Normally, the media will be inserted into or combined with the signal and then the combined signal will preferably be transmitted according to more or less conventional transmission techniques.

[00110] In the present speci ication and claims (if any), the word 'comprising' and its derivatives including 'comprises' and "comprise" include each of the stated integers but does not exclude the inclusion of one or more further integers.

[001111 Reference throughout this specification to 'one embodiment' or 'an embodiment' means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, the appearance of the phrases 'in one embodiment' or 'in an embodiment' in various places throughout this specification are not necessarily all referring to the same embodiment.

Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more combinations.

[00112] In compliance with the statute, the invention has been described in language more or less specific to structural or methodical features. It is to be understood that the invention is not limited to specific features shown or described since the means herein described comprises preferred forms of putting the invention into effect. The invention is, therefore, claimed in any of its forms or modifications within the proper scope of the appended claims (if any) appropriately interpreted by those skilled in the art.

Claims

1. A system for insertion of media into a voice channel the system including:

a. a first input and output apparatus for receiving voice input from a first user and delivering voice output from a second or further user;

b. at least one second input and output apparatus for receiving voice input from a second user and delivering voice output from a first or further user;

c. a voice channel established between the first input and output apparatus and the at least one second input and output apparatus and over which two way communication between the first user and second or further users can take place;

d. an access device to access a stored media file containing media for insertion into the voice channel; and

e. an insertion mechanism to insert at least a portion of the stored media file into the voice channel established for contemporaneous delivery whilst the voice channel is established.

2. A method for insertion of media into a voice channel, the method including the steps of: a. receiving voice input from a first user and delivering voice output from a second or further user using a first input and output apparatus,

b. receiving voice input from a second user and delivering voice output from a first or further user using at least one second input and output apparatus;

c. establishing a voice channel between the first input and output apparatus and the at least one second input and output apparatus and over which two way communication between the first user and second or further users can take place;

d. accessing an access device to access a stored media file containing media for insertion into the voice channel; and

e. inserting at least a portion of the stored media file into the voice channel established for contemporaneous delivery whilst the voice channel is established using an insertion mechanism.