WO2005046201A2 - Appareil de signature audio et procedes associes - Google Patents

Appareil de signature audio et procedes associes

Info

Publication number
WO2005046201A2
WO2005046201A2 PCT/US2004/010562 US2004010562W WO2005046201A2 WO 2005046201 A2 WO2005046201 A2 WO 2005046201A2 US 2004010562 W US2004010562 W US 2004010562W WO 2005046201 A2 WO2005046201 A2 WO 2005046201A2
Authority
WO
WIPO (PCT)
Prior art keywords
audio
signature
signal
information
peak signal
Prior art date
Application number
PCT/US2004/010562
Other languages
English (en)
Other versions
WO2005046201A3 (fr
Inventor
Morris Lee
Original Assignee
Nielsen Media Research, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nielsen Media Research, Inc. filed Critical Nielsen Media Research, Inc.
Priority to TW093131124A priority Critical patent/TW200522722A/zh
Publication of WO2005046201A2 publication Critical patent/WO2005046201A2/fr
Publication of WO2005046201A3 publication Critical patent/WO2005046201A3/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/35Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users
    • H04H60/37Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users for identifying segments of broadcast information, e.g. scenes or extracting programme ID
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/35Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users
    • H04H60/38Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users for identifying broadcast time or space
    • H04H60/41Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users for identifying broadcast time or space for identifying broadcast space, i.e. broadcast channels, broadcast stations or broadcast areas
    • H04H60/43Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users for identifying broadcast time or space for identifying broadcast space, i.e. broadcast channels, broadcast stations or broadcast areas for identifying broadcast channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/56Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54
    • H04H60/58Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54 of audio
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H2201/00Aspects of broadcast communication
    • H04H2201/90Aspects of broadcast communication characterised by the use of signatures

Definitions

  • the present disclosure relates generally to audio/video content delivery and distribution systems and, more specifically, to audio signature apparatus and methods for use with audio/video content delivery and distribution systems.
  • BACKGROUND [0003] Ratings information and/or audio/video content consumption information are typically generated by collecting consumption records (e.g., viewing records) or other information from a group of statistically selected households. Each ofthe statistically selected households typically has a data logging and processing unit commonly referred to as a "home unit" or "site unit.” The site unit may communicate with a variety of attachments that provide inputs to the site unit or that receive outputs from the site unit.
  • a source identification unit such as a frequency detector attachment, which is a well-known device, may be in communication with a television to sense a local oscillator frequency ofthe television tuner.
  • the frequency detector attachment may be used to determine if the television is operating (i.e., is turned on) and to determine to which broadcast channel the television is currently tuned based on a detected frequency.
  • a people counter which is also a well-known device, may be located in the consumption space ofthe television and in communication with the site unit, thereby enabling the site unit to detect the number of persons currently consuming programs displayed on the television.
  • the site unit usually processes the inputs (e.g., channel tuning information, number of viewers, etc.) from the attachments to produce audio/video content consumption records.
  • Consumption records may be generated on a periodic basis (i.e., at fixed time intervals) or may be generated in response to a change in an input such as, for example, a change in the number ofthe persons viewing the television, a change in the channel tuning information (i.e., a channel change), etc. and may contain channel information such as a station or channel number and a time (e.g., a date and time of day) at which the channel was consumed in the case where the content being consumed is associated with a broadcast television program.
  • the consumption records may include content identification (i.e., program identification) information as well as information relating to the time and manner in which the associated content was consumed.
  • content identification i.e., program identification
  • consumption records may contain other or additional information such as the number of viewers present at the viewing time.
  • the site unit collects a quantity of consumption records and typically transmits collected records, usually daily, to a central office or central data processing facility for further processing or analysis.
  • the central data processing facility receives consumption records from site units located in some or all ofthe statistically selected households and analyzes the records to ascertain the audio/video consumption behaviors of a particular household or a particular group of households selected from all participating households. Additionally, the central data processing facility may generate consumption behavior statistics and other parameters indicative of consumption behavior associated with some or all ofthe participating households.
  • digital broadcast systems e.g., satellite-based digital television systems, digital cable systems, etc.
  • digital broadcast systems typically include a digital receiver or set-top box at each subscriber site.
  • the digital receiver or set-top box demodulates a multi-program data stream, parses the multi-program data stream into individual audio and/or video data packets, and selectively processes the data packets to generate an audio/video signal for a desired program.
  • the audio and/or video output signals generated by the set-top box can typically be directly coupled to an audio/video input of an output device (e.g., a television, a video monitor, etc.)
  • an output device e.g., a television, a video monitor, etc.
  • the local oscillator frequency ofthe output device tuner if any, does not necessarily have any meaningful relationship to the channel or program currently being consumed.
  • obtaining consumption records or information is complicated in cases where the audio/video content being consumed is delivered via a local source such as, for example, a DVD player, a DVR, a VCR, or the like.
  • Such local sources enable the same audio/video content to be viewed at various times in different manners across households, thereby precluding the use of a program lineup or the like at the central facility to identify consumed audio/video content based on channel and consumption time information and significantly complicating audio/video content identification at the central facility.
  • Some techniques for identifying consumed audio/video content are based on the use of audio and/or video signatures.
  • signature-based channel or audio/video content identification techniques use one or more characteristics of currently consumed (but not yet identified) audio/video content to generate a substantially unique signature (e.g., a series of digital values, a waveform, etc.) for that content.
  • the signature information for the content being consumed is typically compared to signature information associated with known audio/video content. When a substantial match is found, the currently consumed audio/video content can, with a relatively high probability, be identified as the portion ofthe known audio/video content having substantially matching signature information.
  • the use of audio signatures has proven to be particularly efficient for identifying audio/video content such as, for example, television broadcasts.
  • audio signature-based audio/video content identification techniques are computationally efficient and require less memory than known video signature-based techniques.
  • Some known audio signature-based techniques use the root mean square (RMS) ofthe energy ofthe audio signal associated with a program as signature information.
  • RMS root mean square
  • Such RMS -based techniques do not consider the frequency of the audio signal being monitored and, as a result, may not provide a sufficiently high degree of content identification accuracy.
  • Still other known audio signature-based techniques use the number of zero crossings between fixed time intervals to generate signature information. However, such count-based zero crossing audio signature techniques do not consider the magnitude ofthe audio signals being monitored and, thus, may not provide a sufficiently high degree of content identification accuracy.
  • Fig. 1 is a block diagram of an example system that uses audio signatures to identify audio/video content being consumed.
  • Fig. 2 is a more detailed block diagram that depicts an example manner in which the audio signature processor shown in Fig. 1 may be implemented.
  • Fig. 3 is a more detailed block diagram that depicts an example manner in which the signature generators shown in Fig. 2 may be implemented.
  • Fig. 4 is an example graph that depicts an audio signal that may be processed by the example signature generator shown in Fig. 3.
  • Fig. 5 is an example graph that depicts zero crossing data that may be generated by the example audio signature processor of Fig. 1.
  • Fig. 6 is a block diagram that depicts another example manner in which the audio signature processor shown in Fig. 1 may be implemented.
  • Fig. 7 is a flow diagram that depicts an example manner in which the apparatus shown in Fig. 6 may be configured to perform the audio signature processing functions described herein.
  • Fig. 8 is a flow diagram that depicts an example manner in which the apparatus shown in Fig. 6 may be configured to perform the audio signature generation functions described herein.
  • Fig. 9 is another example manner in which the audio signature apparatus and methods described herein may be implemented.
  • DETAILED DESCRIPTION [0019]
  • Fig. 1 is a block diagram of an example system 10 that uses audio signatures to identify audio/video content.
  • the example system 10 corresponds to an audio/video content consumption site such as, for example, a residence, a business location, or any other site at which video and/or audio programs may be consumed (e.g., viewed, listened to, etc.)
  • Fig. 1 depicts a single system or consumption site 10. However, a plurality of consumption sites may be configured in manners similar or identical to that ofthe example system 10 to implement the audio signature apparatus and methods described herein.
  • the example system 10 includes an output device 12 such as, for example, a television or a video monitor that receives an audio/video content signal 14.
  • the signal 14 may be a broadcast signal transmitted by a broadcast station (not shown) such as, for example, a television station, radio station, etc. that transmits video programs, audio programs and/or other audio/video content or programs (e.g., television programs, advertisements, etc.) over a relatively wide geographic region to a plurality of viewers and/or subscribers.
  • the signal 14 may be provided by one or more local audio/video content delivery device (e.g., DVRs, VCRs, etc.).
  • the signal 14 may include one or more analog signals or information (e.g., conventional analog television transmissions), a digital data stream containing one or more video and/or audio programs in the form of data packets, or any desired combination thereof.
  • the audio/video content signal 14 may be conveyed to the system 10 using any desired combination of wireless and hardwired links, including, for example, satellite links, cellular links, telephone lines, cable, the Internet, etc.
  • a receiver, decoder or set-top box 16 may be serially interposed between the broadcast signal 14 and the output device 12.
  • the set-top box 16 demodulates the broadcast signal, demodulates multi- program data streams and selectively parses video and/or audio data packets associated with a desired channel and/or program.
  • the selected data packets are processed to form an output signal 18 that can be output (e.g., displayed, played, etc.) by the output device 12.
  • the output signal 18 may be a composite video signal, an S-video signal, an RGB signal, or any other displayable video signal applied to the appropriate input connections ofthe output device 12.
  • the set-top box 16 may not be required and the signal 14 can be directly coupled to the output device 12 (e.g., directly coupled to UHF/VHF inputs).
  • the set-top box 16 may also perform access control functions such as, for example, determining what programs are available for viewing by a user ofthe system 10 based on subscription status or subscription information associated with the system 10, generating displayable program guide information, etc.
  • the example system 10 also includes an audio signature processor 20 that may be configured to perform audio signature generation, comparison and/or signature matching detection functions to determine what channel, program and/or audio/video content is currently being consumed via the output device 12. More specifically, the audio signature processor 20 receives the audio/video content signal 14 and an audio signal 22 from the output device- 12. The audio signal 22 contains audio information associated with the audio/video content currently being consumed via the output device 12 (e.g., the television channel and program to which the system 10 is currently tuned). Alternatively, the audio signature processor 20 may be coupled to an acoustic transducer 24 such as, for example, a microphone that is proximate to an acoustic output device (e.g., a speaker) associated with the output device 12. In that case, the acoustic transducer 24 supplies an audio signal 25 containing information associated with the program currently being consumed via the output device 12 instead ofthe audio signal 22.
  • an acoustic transducer 24 such as, for example, a microphone that is proximate to an
  • the audio signature processor 20 may generate signatures or signature information (i.e., signature information that is associated with the audio/video content currently being consumed via the output device 12 and generated from the audio signal 22 or, alternatively, for the audio signal 25 provided by the acoustic transducer 24), and reference signatures or signature information from the audio/video content signal 14.
  • the audio signature processor 20 sequentially generates reference signatures or signature information for a series of audio/video content portions (e.g., audio/video programs) derived from the signal 14 and compares the reference signature information to the signature information ofthe audio/video content currently being consumed.
  • the audio signature processor 20 identifies the audio/video content currently being consumed via the output device 12 as the audio/video content associated with the reference signature information to which the currently viewed audio/video content signature information substantially matched.
  • the audio signature processor 20 is coupled to a site unit 26.
  • the audio signature processor 20 periodically or continuously sends consumption information (e.g., channel information, program information, audio/video content identification information, etc.) associated with audio/video content consumed via the output device 12 to the site unit 26.
  • the site unit 26 processes the consumption information it receives from the audio signature processor 20 and sends, for example, viewing or consumption records to a central facility (not shown) via a communication link 28.
  • the communication link 28 may include one or more wireless communication links (e.g., cellular, satellite, etc.), hardwired communication links (e.g., phone lines), or any other combination of communication hardware and technology platforms, communication protocols, etc.
  • Fig. 1 depicts the audio signature processor 20 as being located at a consumption site, some or all ofthe functions ofthe audio signature processor 20 can be distributed among a plurality of physical locations. For instance, as discussed in greater detail in connection with the example system shown in Fig. 9 below, the reference signature generation function, the signature comparison function and/or the signature match detection function may be performed by different physical systems, some or all of which may be located in different physical locations.
  • the example system 10 of Fig. 1 may be configured to receive and process audio/video content delivered via a broadcast signal (i.e., the signal 14), those having ordinary skill in the art will readily recognize that the example system 10 could be configured to alternatively or additionally receive and process audio/video content delivered via other manners.
  • local audio/video content delivery devices such as a DVD player, a DVR, a VCR, etc. may provide signals containing audio/video content consumed via the output device 12 to the audio signature processor 20.
  • audio/video content delivered via the signal 14 audio/video content delivered via one or more local sources could be processed by the audio signature processor 20 and/or the site unit 26 to generate audio/video consumption information.
  • Fig. 2 is a more detailed block diagram that depicts an example manner in which the audio signature processor 20 shown in Fig. 1 may be implemented.
  • the example audio signature processor 120 shown in Fig. 2 includes an audio/video content selector 122 that receives a reference signal 121 (which may, for example, correspond to the signal 14) that contains audio/video content (e.g., a plurality of broadcast channels and programs) available for consumption (e.g., viewing, listening, etc.) and selects a particular portion of that audio/video content (e.g., a particular broadcast channel and/or program) for further processing.
  • a reference signal 121 which may, for example, correspond to the signal 14
  • audio/video content e.g., a plurality of broadcast channels and programs
  • available for consumption e.g., viewing, listening, etc.
  • selects a particular portion of that audio/video content e.g., a particular broadcast channel and/or program
  • the audio/video content selector 122 may vary a demodulator mixing frequency to selectively tune to a particular channel.
  • the audio/video content selector 122 may include digital receiver functions that demodulate, demultiplex and selectively parse the data stream to extract audio data packets associated with a particular broadcast channel and/or audio/video program. In either case, the techniques for processing such multi-program analog signals and digital signals are well known and, thus, are not described in greater detail herein.
  • the audio/video content selection performed by the audio/video content selector 122 results in the sequential generation of signature information for a plurality of audio/video content portions (e.g., broadcast channels and programs). Also, generally, the audio/video content selection continues (i.e., selects different content portions such as, for example, different broadcast channels) until the audio signature processor 120 determines that a substantial match has been identified (i.e., that the audio/video currently being consumed via the output device 12 (Fig.
  • the selection process performed by the audio/video content selector 122 may be based on a predetermined numerical sequence. In another example, the selection process may be based on a probability of matching. For example, the audio/video content selector 122 may select those broadcast channels consumed most recently. Additionally or alternatively, the audio/video content selector 122 may select broadcast channels in a sequence based on their frequency of consumption.
  • the audio/video content selector 122 outputs a signal 124 to a signature generator 126.
  • the signal 124 contains audio information derived from the audio/video content currently selected by the audio/video content selector 122.
  • the signature generator 126 processes the audio information it receives to generate audio signature information therefrom.
  • the signature generator 126 measures the magnitudes ofthe peak values ofthe signal 124 between successive zero crossings ofthe signal 124.
  • a zero crossing is a point of zero signal magnitude between a positive signal magnitude and a negative signal magnitude.
  • the sum ofthe magnitudes is then calculated for each of a plurality of time intervals or sample intervals. As a result, a series of such sums is generated.
  • the resulting series of sums is substantially uniquely characteristic ofthe signal 124 and, thus, may be used as a reference audio signature for the audio/video content being provided to the signature generator 126 via the audio/video content selector 122.
  • Another signature generator 128 receives an audio output signal from the output device 12 (e.g., the audio signal 22 or, alternatively, the audio signal 25 from the acoustic transducer 24).
  • the signature generator 128 is substantially the same as or identical to the signature generator 126 and, thus, generates audio signature information in a substantially similar or identical manner to that ofthe signature generator 126.
  • the signature generators 126 and 128 may each include a smoothing function that serves to eliminate noise, spurious data, etc. to facilitate the matching of signature information derived from the same audio/video content delivered via different signal paths (e.g., via an output device audio output and the reference signal).
  • a time stamper 130 may optionally be coupled to one or both ofthe signature generators 126 and 128.
  • the time stamper 130 may provide time stamps that are used by the signature generators 126 and 128 to time stamp signature data generated thereby. For example, each sum of peak values may have a time stamp associated therewith to facilitate subsequent signature comparison operations, correlation operations, matching operations, etc.
  • the time stamper 130 may generate relatively coarse time intervals such as, for example, one second increments, each of which may correspond to an absolute time or a relative time based on some reference time.
  • time stamps and thus, the time stamper 130 are not necessary to perform signature comparisons, correlations and the like.
  • the signature generators 126 and 128 provide respective signature information 132 and 134 to a signature comparator 136.
  • the signature comparator 136 compares the audio signature information 132 associated with or generated from the audio/video content information (i.e., the signal 124) that is currently selected (i.e., selected by the audio/video content selector 122) from the reference signal 121 to the signature information 134, which is associated with or generated from the output device audio output signal (i.e., an audio signal or information associated with the audio/video content currently being consumed via the output device 12).
  • the signature comparator 136 may use time stamps associated with the signature data when performing signature matching activities, correlation activities, etc.
  • the time stamps may be used to approximately align or correlate data in different signature data records or files. Those having ordinary skill in the art will appreciate that such a rough or approximate alignment may significantly reduce the amount of time and/or processing needed to perform signature matching operations.
  • the comparison of audio signatures can be performed using any known or desired technique. One preferred manner of comparing audio signatures, particularly in examples where the signature comparisons are performed locally (e.g., at each consumption site), uses the well-known standard normalized correlation formula.
  • the signature comparator 136 compares signatures by calculating a sum of differences between the reference signature information 132 and the signature information associated with the audio/video content currently being consumed (i.e., the signature information 134) over a predetermined interval or time period. If the sum of differences is less than or equal to a predetermined threshold, the signature comparator 136 may provide an output signal or information 138 indicating that at least a substantial match has been detected (i.e., that the known audio/video content currently selected by the audio/video content selector 122 is substantially similar or identical to the audio/video content currently being consumed via the output device 12). Under this approach, threshold values may be selected to suit the requirements ofthe application. For example, relatively low threshold values may be used if a high degree of certainty in the audio/video content identification process is desired.
  • the signature comparator 136 may calculate a difference signal or error signal and then calculate an average error, a peak or maximum error, a standard deviation of error, or any other parameters characteristic ofthe differences, if any, between the signature information 132 and 134. One or more of those parameters or characteristics may be compared to one or more threshold values and a determination of whether a substantial' match or an identical match is indicated via the output 138 based on whether those parameters or characteristics are less than or greater than the one or more threshold values. [0036] The signature comparator 136 may also provide a feedback signal or information 140 to the audio/video content selector 122 to facilitate the sequential selection of audio/video content from the reference signal 121.
  • the feedback signal 140 may indicate a non- match to the audio/video content selector 122.
  • the audio/video content selector 122 may select or tune the next portion of audio/video content (e.g., a different broadcast channel and/or program) in its search or scan sequence.
  • An audio/video content identifier 142 is coupled to the signature comparator 136 and receives the match information output 138.
  • the audio/video content identifier 142 determines to what audio/video content portion (e.g., channel) the audio/video content selector 122 is tuned and, thus, can identify what audio/video content is currently being consumed via the output device 12.
  • the audio/video content identifier 142 may be configured to determine, based on a program lineup or schedule, which channel and/or program is being consumed via the output device 12 (Fig. 1).
  • the audio/video content identifier 142 may be coupled to the site unit 26 (Fig. 1) to provide the identification information (e.g., broadcast channel, program identity, etc.) to the site unit 26 (Fig. 1).
  • the site unit 26 can use the audio/video content identification information provided by the audio/video content identifier 142 to generate viewing records and the like.
  • Fig. 3 is a more detailed block diagram that depicts an example manner in which the signature generators 126 and 128 shown in Fig. 2 may be implemented.
  • the example signature generator 226 shown in Fig. 3 receives an input signal 228 (which may, for example, correspond to the signal 124 output by the audio/video selector 122) via signal conditioning circuitry 230.
  • the signal conditioning circuitry 230 may include analog and or digital circuitry for filtering (e.g., noise filtering, anti- aliasing filtering, transient filtering, etc.). One particularly useful filtering circuit may provide a bandpass filter characteristic from 300 hertz to 3000 hertz. Additionally or alternatively, the signal conditioning circuitry 230 may include protection circuitry (e.g., surge protection circuitry), level shifting circuitry, amplification circuitry, attenuation circuitry, or any other known or desired signal conditioning circuitry. Of course, the signal conditioning circuitry 230 may be eliminated from the signature generator 226 in the event that the signals provided to the signature generator 226 do not require conditioning.
  • filtering e.g., noise filtering, anti- aliasing filtering, transient filtering, etc.
  • One particularly useful filtering circuit may provide a bandpass filter characteristic from 300 hertz to 3000 hertz.
  • the signal conditioning circuitry 230 may include protection circuitry (e.g., surge protection circuitry), level shifting circuitry,
  • Conditioned signal(s) 232 output by the signal conditioning circuitry 230 are provided to a zero crossing detector 234 and a peak detector 236.
  • the zero crossing detector 234 may use a one-shot multi-vibrator or the like to output a pulse to the peak detector 236 each time a zero crossing occurs within the conditioned signal(s) 232.
  • the peak detector 236 may be implemented using any desired peak detection circuit to detect peak signal magnitude. For example, in the case where the signals 232 are analog signals, a diode, capacitor and bleed resistor combination may be used to detect peak value. On the other hand, in the case where the signals 232 are digital values, the peak detector 236 may simply retain the largest numerical value following a reset.
  • the peak detector 236 resets (e.g., to zero) in response to zero crossing pulses or other signals provided by the zero crossing detector 234. As a result, the peak detector 236 outputs a series of signal peak magnitudes, each of which occurs between successive zero crossings.
  • a summer 238 receives the series of peak signal magnitudes from the peak detector 236 and generates sums of these peak signal magnitudes for each ofthe predetermined time intervals or sample intervals. In one example, the summer 238 may sum a plurality of peak magnitudes (absolute values) occurring within a predetermined number of samples (e.g., 125 samples) collected at a predetermined rate (e.g., 8000 samples per second) from the conditioned signal 232.
  • An output 242 ofthe summer 238 provides a series of positive sum values at a rate equal to the sample rate divided by the sample size for each sum.
  • the output 242 provides sums at a rate of 64 per second.
  • the summer 238 may also receive a signal 240 (e.g., time stamp values) from the time stamper 130 (Fig. 2) that enables the summer 238 to associated time stamp values with one or more ofthe sums at the output 242.
  • the signature generator 226 may also include a smoother 244 that performs a smoothing function on the series of sums output by the summer 238.
  • the smoother 244 may perform a low pass filter function to eliminate noise and other spurious interference or signal components that may adversely affect signature match detection.
  • a and b may be different values if desired to suit the needs of a particular application.
  • a smoothing function such as the example function set forth above may be successively applied to the data multiple times (e.g., ten times) to achieve a desired smoothness.
  • the filtering performed by the smoother 244 may be implemented using any desired combination of passive components (i.e., resistors, capacitors, inductors, etc.), active components (i.e., transistors, operational amplifiers, etc.) and/or digital components (i.e., digital logic, processing units, memory, etc.).
  • passive components i.e., resistors, capacitors, inductors, etc.
  • active components i.e., transistors, operational amplifiers, etc.
  • digital components i.e., digital logic, processing units, memory, etc.
  • Fig. 4 is an example graph 300 that depicts an audio signal 302 that may be processed by the example signature generator 226 shown in Fig. 3.
  • the signal 302 may be provided as the input 228 (Fig. 3) to the signal conditioning circuitry 230 (Fig. 3) that, in turn, provides a conditioned version (e.g., the conditioned signal 232) ofthe signal 302 to the zero crossing detector 234 and the peak detector 236.
  • a conditioned version e.g., the conditioned signal 232
  • the zero crossing detector 234 Fig. 3 outputs a pulse to the peak detector 236 (Fig. 3) that resets the peak value retained by the peak detector 236 to zero.
  • the signal 302 increases in magnitude until it reaches a peak value "a.”
  • the peak detector 236 retains the peak value "a” while the signal 302 decreases between "a” and a second zero crossing 306.
  • the peak detector 236 outputs the magnitude ofthe peak value "a” to the summer 238 and then resets its retained peak value to zero.
  • the signal 302 decreases in value until it reaches a negative peak "b," the absolute value of which is retained by the peak detector 236 as the signal 302 increases to zero at a third zero crossing 308.
  • the peak detector 236 outputs the magnitude of "b" to the summer 238 and the zero crossing detector 234 outputs a pulse that resets the retained peak value ofthe peak detector 236 to zero.
  • This process continues for peaks "d,” through “k” up to time a t ! (which may correspond to a predetermined time interval or sample interval including, for example, 125 samples), at which point the summer 238 output its retained sum to the smoother 244 (Fig. 3) and resets its retained sum to zero.
  • the signature generator 126 repeats this process over the predetermined time interval, sample interval, or number of samples between times ti and t 2 for peaks "1" through “r,” which results in a second sum being passed to the smoother 244.
  • Fig. 5 is an example graph that depicts zero crossing data that may be generated by the example audio signature processor 120 of Fig. 2 from an audio signal such as, for example, the example signal 302 of Fig. 4. More specifically, the sequence of sums output by the summer 238 (e.g., the output 242), which may be generally referred to as a zero crossing energy curve, is depicted at reference numeral 320.
  • the zero crossing energy curve 320 has fewer discrete data values in any given time period or sample period because a plurality of samples or peak magnitude values from the signal 302 are summed to form each data value making up the zero crossing energy curve 320.
  • a data reduction ratio of 125:1 i.e., from the signal 302 to the zero crossing energy curve 320
  • the zero crossing energy curve 320 may then be processed by the smoother 244 to form a smoothed zero crossing energy curve 322.
  • the example audio signature processor 420 is a processor-based system that includes a processor 422 coupled to a memory 424 having software, firmware and/or any other machine readable instructions 425 stored therein. As described in greater detail in connection with Figs. 7 and 8 below, the processor 422 executes the software, firmware and/or machine readable instructions 425 to implement substantially the same signature generation, signature comparison and audio/content (e.g., channel, program, etc.) identification functions performed by the apparatus shown in Figs. 2 and 3. [0047]
  • the processor 422 may be any suitable microprocessor, microcontroller or processing unit capable of executing machine readable instructions.
  • the memory 424 may be an desired combination of volatile and non- volatile memory including, for example, read-only memory (ROM) devices, electrically erasable programmable read only memory (EEPROM) devices, random access memory (RAM) devices, mass storage devices (e.g., disk drives for magnetic or optical media), or any other suitable storage devices.
  • ROM read-only memory
  • EEPROM electrically erasable programmable read only memory
  • RAM random access memory
  • mass storage devices e.g., disk drives for magnetic or optical media
  • the example audio signature processor 420 may also include an audio/video content selector 426, which may be substantially similar or identical to the audio/video content selector 122 (Fig. 2) that receives a reference signal such as, for example, the signal 14 (Fig. 1).
  • the audio/video content selector 426 is controlled by the processor 422 to scan or search a series of audio/video content portions (e.g., broadcast channels or programs) in a predetermined sequence. As described above in connection with the audio/video content selector 122, the audio/video content selection process or sequence may be based one or more factors such as, for example, the frequency with which audio/video content portions (e.g., channels) are typically consumed, the most recently consumed channels, etc.
  • the audio/video content selector 426 outputs a signal 428 that contains audio information associated with a particular portion of audio/video content derived from the reference signal (e.g., a particular broadcast channel, program, etc.)
  • a signal conditioner 430 receives the signal 428 and the output device output signal 22 (Fig. 1).
  • the signal conditioner 430 may be substantially the same or identical to the signal conditioning circuitry 230 (Fig. 3).
  • An analog-to-digital (A D) converter 432 receives the audio signals output by the signal conditioner 430.
  • the A/D converter 432 includes at least two channels to enable the simultaneous analog-to-digital conversion ofthe conditioned versions of the reference signal 428 and the output device audio output.
  • the audio signature processor 420 also includes a communications interface 434 that enables the processor 422 to communicate with the site unit 26 (Fig.
  • the communications interface 434 may include level shifting circuitry, bus line or signal line driver circuitry and/or any other known or desired communications circuitry.
  • Fig. 7 is a flow diagram that depicts an example manner in which the audio signature processor 420 shown in Fig. 6 may be configured to perform the audio signature processing functions described herein.
  • the processor 422 controls the audio/video content selector 426 (Fig. 6) to select a known audio/video content (e.g., a broadcast channel or program) from the reference signal (block 500).
  • a known audio/video content e.g., a broadcast channel or program
  • the processor 422 may cause the audio/video content selector
  • the processor 422 controls the A/D converter 432 to acquire audio information or data associated with the selected audio/video content (via the signal 428 and the signal conditioner 430) and the output device audio output (block 502). Specifically, the processor 422 acquires digital values representing an audio reference signal (associated with known, selected audio/video content) and an audio signal associated with audio/video content currently being consumed via the output device 12 (Fig. 1).
  • the processor 422 generates audio signatures or signature information from the acquired digital values (block 504) by summing peak signal magnitudes between successive zero crossings over a plurality of time intervals or sample intervals as described in greater detail in connection with Fig. 8.
  • the processor 422 compares the audio signature information (block 506) and determines if the audio signature information associated with the selected audio/video content matches (i.e., is substantially similar or identical to) the audio signature information associated with the audio/video content (e.g., channel or program) currently being consumed via the output device 12 (Fig. 1) (block 508).
  • the processor 422 (Fig. 6) identifies the audio/video content currently being consumed via the output device 12 as the audio/video content currently selected by the audio/video content selector 426 (Fig. 6) (block 510).
  • the processor 422 may, if desired, also determine the program (e.g., a television program) that is associated with the identified audio/video content by using channel information (which may be known by the processor 422 as described below), local time information (i.e., the time at the consumption site 10) and program guide information (e.g., a television programming guide).
  • the processor 422 may be programmed to cause the audio/video content selector 426 to rune to one or more predetermined channels or programs and may associate channel or other tuning information (e.g., packet selection criteria in the case where the reference signal contains digital audio/video signals) with the signature information.
  • the processor 422 determines if there are more audio/video content portions (e.g., channels) to be scanned or searched within the signal 14 (Fig. 1) (block 512). If there are more audio/video content portions to scan or search, the processor 422 (Fig.
  • 6) selects another audio/video content portion (e.g., another channel) from the signal 14 (Fig. 1) (block 514).
  • the selection of a next audio/video content portion (block 514) may be based on a determination of what audio/video content portions are more likely to be consumed via the output device 12 (Fig. 1), what audio/video content portions (e.g., channels) were most recently consumed, a predetermined sequence of audio/video content portions, or in any other desired manner.
  • control returns to block 502.
  • the processor 422 may execute error handling routines such as, for example, generating an error message that can be transmitted to the site unit 26 (Fig. 1) via the communications interface 434 (Fig. 6).
  • Fig. 8 is a flow diagram that depicts an example manner by which the audio signature processor 420 shown in Fig. 6 may be configured to perform the audio signature generation function described in connection with block 504 of Fig. 7.
  • the processor 422 resets a sample interval sum to zero (block 600) and then waits for a zero crossing ofthe audio signal for which a signature is to be generated (e.g., the output device audio output signal and/or the signal 428) (block 602).
  • the processor 422 Fig. 6) continuously or periodically acquires the peak magnitude ofthe signal (block 604) until a subsequent zero crossing is detected (block 606).
  • the processor 422 After the subsequent zero crossing is detected (block 606), the processor 422 adds the peak magnitude acquired at block 604 to an interval sum (block 608). The processor 422 then determines if the sample interval has expired (e.g., a predetermined amount of time has elapsed, a predetermined number of samples have been acquired, etc.) (block 610). The sample interval may be a predetermined amount of time during which peak magnitudes are summed. If the sample interval has not expired (block 610), the processor 422 returns control to block 604. On the other hand, if the sample interval has expired, the processor 422 sends the current interval sum to a smoothing function (block 612) and then returns control to block 600.
  • a smoothing function block 612
  • Figs. 7 and 8 may be especially useful when applied in a system that performs local (e.g., at each consumption site) audio/video content matching or correlation on a substantially real time basis.
  • Figs. 7 and 8 may be readily adapted for use in applications in which audio/video content matching or correlation is performed at a central facility and/or using historical (e.g., previously gathered) consumption records or information.
  • Fig. 9 is another example manner in which the audio signature apparatus and methods described herein may be implemented.
  • the broadcast station 702 transmits one or more signals containing digital and/or analog audio/video content information to a reference site 708 and at least one consumption site 710 via communication paths or links 712 and 714.
  • the communication paths or links 712 and 714 may include any combination of hardwired or wireless links such as, for example, satellite links, wireless land-based links, cable links, etc.
  • the signals conveyed via the links 712 and 714 may contain multi-program analog signals and/or digital data streams, which are commonly employed with existing broadcast systems.
  • the functions performed by the audio signature processor 20 shown in Fig. 1 are distributed between the reference site 708 and the consumption site 710.
  • the reference site 708 includes a plurality of receivers (e.g., set-top boxes or the like) 716, 718 and 720 that simultaneously demodulate, demultiplex and/or decode audio, video and/or other information received from the broadcast station 702.
  • each ofthe receivers 716, 718 and 720 provides audio information associated with a different program that is currently being broadcast to a reference site processor 722.
  • the receiver 716 may provide audio information associated with a program A while the receivers 718 and 720 provide audio information associated with respective programs B and C.
  • the reference site processor 722 is configured to control and/or has information indicating to which program each ofthe receivers 716, 718 and 720 is currently tuned.
  • the reference site processor 722 includes the apparatus and methods described herein for generating reference signature information for a plurality of simultaneously broadcast audio/video content.
  • the reference site processor 722 may include one or more signature generators 226 shown and described in connection with Fig. 3.
  • the reference site processor 722 sends the reference signature information it generates to a central processing facility 724, which stores the reference signature information in a database 726.
  • the consumption site 710 could be, for example, a statistically selected home containing a television, a radio, a computer, etc.
  • the consumption site 710 includes an output device 728 (e.g., a video display, speaker, etc.)
  • the consumption site 710 also includes a receiver (e.g., a set-top box) 730, which may be similar or identical to the receivers 716-720. Such receivers are well known and, thus, are not described in greater detail herein.
  • the receiver 730 provides audio and/or video signals 732 to the output device 728 that are used to output the program currently selected for consumption.
  • the receiver 730 also provides an audio signal 734 containing audio information associated with the currently consumed program to a consumption site processor 736.
  • the consumption site processor 736 performs the signature generation functions described in connection with the signature generator 226 of Fig. 3 and sends the generated audio signature information to a site unit 738.
  • the site unit 738 stores and periodically conveys the generated audio signature information to the central processing facility 724.
  • the central processing facility 724 is configured to perform the functions ofthe audio signature comparator 136 and the audio/video content identifier 142 discussed in connection with Fig. 2.
  • the central processing facility 724 compares signature information generated at the consumption site 710 to the reference signature information stored in the database 726 to identify the channels and/or programs consumed at the consumption site 710.
  • the reference site processor 722 and the consumption site processor 736 may generate time stamp information and associate such time stamp information with the audio signature information. For example, each interval sum may have an associated time stamp. In this manner, the central processing facility 724 can temporally align (at least approximately) the audio signature information received from the sites 708 and 710 prior to making comparisons.
  • the functional blocks making up these examples can be implemented using any desired combination of analog and/or digital hardware. Further, some or all of these functional blocks may be implemented within a processor-based system that executes machine readable instructions or programs, which may be embodied in software stored on a tangible medium such as a CD-ROM, a floppy disk, a hard drive, a DVD, or a memory. Still further, some or all ofthe functional blocks may be combined or integrated together and/or implemented using multiple functional blocks.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Television Systems (AREA)

Abstract

Appareil d'identification de canaux ou de programmes sur la base de signatures et procédés d'utilisation de systèmes de fourniture et de distribution de contenu audio / vidéo. Les procédés selon la présente invention comportent l'acquisition d'amplitudes de signal maximales entre des passages par zéro d'un signal audio de référence et d'un signal audio associé au contenu audio / vidéo consommé sur le site de consommation. Des signatures pour les signaux sont établies à l'aide des sommes des amplitudes de signal maximales et les signatures sont comparées pour identifier le contenu audio / vidéo consommé sur le site de consommation.
PCT/US2004/010562 2003-10-16 2004-04-06 Appareil de signature audio et procedes associes WO2005046201A2 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW093131124A TW200522722A (en) 2003-10-16 2004-10-14 Audio signature apparatus and methods

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US51183803P 2003-10-16 2003-10-16
US60/511,838 2003-10-16

Publications (2)

Publication Number Publication Date
WO2005046201A2 true WO2005046201A2 (fr) 2005-05-19
WO2005046201A3 WO2005046201A3 (fr) 2005-08-11

Family

ID=34572750

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2004/010562 WO2005046201A2 (fr) 2003-10-16 2004-04-06 Appareil de signature audio et procedes associes

Country Status (2)

Country Link
TW (1) TW200522722A (fr)
WO (1) WO2005046201A2 (fr)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008008915A2 (fr) 2006-07-12 2008-01-17 Arbitron Inc. Procédés et systèmes pour attestation de conformité et mesures incitatives
WO2011063520A1 (fr) * 2009-11-30 2011-06-03 Miranda Technologies Inc. Procédé et appareil de délivrance de signatures de signaux audio/vidéo et d'utilisation de celles-ci
US8108887B2 (en) 2008-10-30 2012-01-31 The Nielsen Company (Us), Llc Methods and apparatus for identifying media content using temporal signal characteristics
US8860883B2 (en) 2009-11-30 2014-10-14 Miranda Technologies Partnership Method and apparatus for providing signatures of audio/video signals and for making use thereof
US9942584B2 (en) 2006-03-27 2018-04-10 The Nielsen Company (Us), Llc Methods and systems to meter media content presented on a wireless communication device

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8503991B2 (en) 2008-04-03 2013-08-06 The Nielsen Company (Us), Llc Methods and apparatus to monitor mobile devices

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5672999A (en) * 1996-01-16 1997-09-30 Motorola, Inc. Audio amplifier clipping avoidance method and apparatus
US5881360A (en) * 1995-10-11 1999-03-09 Adcom Information Services, Inc. Non-video path data collection device

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5881360A (en) * 1995-10-11 1999-03-09 Adcom Information Services, Inc. Non-video path data collection device
US5672999A (en) * 1996-01-16 1997-09-30 Motorola, Inc. Audio amplifier clipping avoidance method and apparatus

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
KIM D. ET AL.: 'Auditory processing of speech signals for robust speech recognition in real-world noisy environments' IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING vol. 7, no. 1, January 1999, pages 55 - 69, XP000890825 *

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9942584B2 (en) 2006-03-27 2018-04-10 The Nielsen Company (Us), Llc Methods and systems to meter media content presented on a wireless communication device
US11765411B2 (en) 2006-03-27 2023-09-19 The Nielsen Company (Us), Llc Methods and systems to meter media content presented on a wireless communication device
US11677997B2 (en) 2006-03-27 2023-06-13 The Nielsen Company (Us), Llc Methods and systems to meter media content presented on a wireless communication device
US11190816B2 (en) 2006-03-27 2021-11-30 The Nielsen Company (Us), Llc Methods and systems to meter media content presented on a wireless communication device
US10785519B2 (en) 2006-03-27 2020-09-22 The Nielsen Company (Us), Llc Methods and systems to meter media content presented on a wireless communication device
US10412427B2 (en) 2006-03-27 2019-09-10 The Nielsen Company (Us), Llc Methods and systems to meter media content presented on a wireless communication device
WO2008008911A2 (fr) 2006-07-12 2008-01-17 Arbitron Inc. procédés et systèmes pour une confirmation de conformité et des incitations
WO2008008915A2 (fr) 2006-07-12 2008-01-17 Arbitron Inc. Procédés et systèmes pour attestation de conformité et mesures incitatives
US10547877B2 (en) 2008-10-30 2020-01-28 The Nielsen Company (Us), Llc Methods and apparatus for identifying media content using temporal signal characteristics
US11025966B2 (en) 2008-10-30 2021-06-01 The Nielsen Company (Us), Llc Methods and apparatus for identifying media content using temporal signal characteristics
US9576197B2 (en) 2008-10-30 2017-02-21 The Nielsen Company (Us), Llc Methods and apparatus for identifying media content using temporal signal characteristics
US8108887B2 (en) 2008-10-30 2012-01-31 The Nielsen Company (Us), Llc Methods and apparatus for identifying media content using temporal signal characteristics
US10009635B2 (en) 2008-10-30 2018-06-26 The Nielsen Company (Us), Llc Methods and apparatus for generating signatures to identify media content using temporal signal characteristics
US11627346B2 (en) 2008-10-30 2023-04-11 The Nielsen Company (Us), Llc Methods and apparatus for identifying media content using temporal signal characteristics
US8887191B2 (en) 2008-10-30 2014-11-11 The Nielsen Company (Us), Llc Methods and apparatus for identifying media using temporal signal characteristics
EP2182468B1 (fr) * 2008-10-30 2020-04-01 The Nielsen Company (US), LLC Procédés et appareil pour identifier un contenu de média en utilisant les caractéristiques de signal temporelles
GB2508115A (en) * 2009-11-30 2014-05-21 Miranda Technologies Inc Generating audio signatures and providing an indication of lip sync delay
GB2489133B (en) * 2009-11-30 2014-05-07 Miranda Technologies Inc Method and apparatus for providing signatures of audio/video signals and for making use thereof
GB2508115B (en) * 2009-11-30 2014-08-27 Miranda Technologies Inc Method and apparatus for providing signatures of audio/video signals and for making use thereof
GB2489133A (en) * 2009-11-30 2012-09-19 Miranda Technologies Inc Method and apparatus for providing signatures of audio/video signals and for making use thereof
US10116838B2 (en) 2009-11-30 2018-10-30 Grass Valley Canada Method and apparatus for providing signatures of audio/video signals and for making use thereof
US8860883B2 (en) 2009-11-30 2014-10-14 Miranda Technologies Partnership Method and apparatus for providing signatures of audio/video signals and for making use thereof
WO2011063520A1 (fr) * 2009-11-30 2011-06-03 Miranda Technologies Inc. Procédé et appareil de délivrance de signatures de signaux audio/vidéo et d'utilisation de celles-ci

Also Published As

Publication number Publication date
WO2005046201A3 (fr) 2005-08-11
TW200522722A (en) 2005-07-01

Similar Documents

Publication Publication Date Title
US11627346B2 (en) Methods and apparatus for identifying media content using temporal signal characteristics
US8065700B2 (en) Methods and apparatus for identifying audio/video content using temporal signal characteristics
CA2556548C (fr) Procedes et appareil permettant de determiner les programmes enregistres regardes par les spectateurs
US8020179B2 (en) Methods and apparatus to adaptively select sensor(s) to gather audience measurement data based on a variable system factor and a quantity of data collectible by the sensors
US8006258B2 (en) Methods and apparatus for media source identification and time shifted media consumption measurements
US20040181799A1 (en) Apparatus and method for measuring tuning of a digital broadcast receiver
US20020100047A1 (en) Method of recording programs recommended by opinion leader selected by user, and apparatus for automatically recording broadcasts
CA2645793A1 (fr) Systeme et methode d'adaptation audio
US10547905B2 (en) Systems, methods, and devices for monitoring content viewership using short-range wireless communication
CN101969580A (zh) 基于调谐频点与台标匹配的收视率统计系统及其统计方法
WO2005046201A2 (fr) Appareil de signature audio et procedes associes
CN201893881U (zh) 基于调谐频点与台标匹配的收视率统计系统
AU2001281320A1 (en) Apparatus and method for determining the programme to which a digital broadcast receiver is tuned

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase