WO2002021507A2 - Voice activity detector for integrated telecommunications processing - Google Patents
Voice activity detector for integrated telecommunications processing Download PDFInfo
- Publication number
- WO2002021507A2 WO2002021507A2 PCT/US2001/027596 US0127596W WO0221507A2 WO 2002021507 A2 WO2002021507 A2 WO 2002021507A2 US 0127596 W US0127596 W US 0127596W WO 0221507 A2 WO0221507 A2 WO 0221507A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- flag
- energy
- determine whether
- voice
- noise
- Prior art date
Links
- 238000012545 processing Methods 0.000 title claims abstract description 186
- 230000000694 effects Effects 0.000 title claims description 76
- 238000001514 detection method Methods 0.000 claims abstract description 102
- 230000004913 activation Effects 0.000 claims abstract description 18
- 239000004065 semiconductor Substances 0.000 claims abstract description 6
- 238000000034 method Methods 0.000 claims description 92
- 230000006870 function Effects 0.000 claims description 31
- 230000003111 delayed effect Effects 0.000 claims description 20
- 206010019133 Hangover Diseases 0.000 claims description 12
- 230000003993 interaction Effects 0.000 claims description 10
- 238000003780 insertion Methods 0.000 claims description 8
- 230000037431 insertion Effects 0.000 claims description 7
- 230000009466 transformation Effects 0.000 claims 5
- 230000015654 memory Effects 0.000 description 149
- 230000008569 process Effects 0.000 description 66
- 238000013515 script Methods 0.000 description 45
- 238000004891 communication Methods 0.000 description 43
- 238000004422 calculation algorithm Methods 0.000 description 40
- 238000010586 diagram Methods 0.000 description 35
- 239000000872 buffer Substances 0.000 description 32
- 239000013598 vector Substances 0.000 description 16
- 230000004044 response Effects 0.000 description 15
- 238000012546 transfer Methods 0.000 description 12
- 230000005540 biological transmission Effects 0.000 description 11
- 230000011664 signaling Effects 0.000 description 11
- 238000004364 calculation method Methods 0.000 description 10
- 238000007906 compression Methods 0.000 description 8
- 230000006835 compression Effects 0.000 description 8
- 230000006837 decompression Effects 0.000 description 8
- 235000019800 disodium phosphate Nutrition 0.000 description 8
- 238000002592 echocardiography Methods 0.000 description 8
- 230000000875 corresponding effect Effects 0.000 description 6
- 230000008901 benefit Effects 0.000 description 5
- 230000008859 change Effects 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 230000009471 action Effects 0.000 description 3
- 230000003044 adaptive effect Effects 0.000 description 3
- 230000001276 controlling effect Effects 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000007726 management method Methods 0.000 description 3
- 230000007781 signaling event Effects 0.000 description 3
- 102000020897 Formins Human genes 0.000 description 2
- 108091022623 Formins Proteins 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 2
- 230000001934 delay Effects 0.000 description 2
- 238000001152 differential interference contrast microscopy Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- KWTSXDURSIMDCE-UHFFFAOYSA-N 1-phenylpropan-2-amine Chemical compound CC(N)CC1=CC=CC=C1 KWTSXDURSIMDCE-UHFFFAOYSA-N 0.000 description 1
- RYGMFSIKBFXOCR-UHFFFAOYSA-N Copper Chemical compound [Cu] RYGMFSIKBFXOCR-UHFFFAOYSA-N 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000004931 aggregating effect Effects 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 239000003990 capacitor Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000013144 data compression Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- RPOCQUTXCSLYFJ-UHFFFAOYSA-N n-(4-ethylphenyl)-2-(2-methyl-3,5-dioxothiomorpholin-4-yl)acetamide Chemical compound C1=CC(CC)=CC=C1NC(=O)CN1C(=O)C(C)SCC1=O RPOCQUTXCSLYFJ-UHFFFAOYSA-N 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000029058 respiratory gaseous exchange Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
Definitions
- This invention relates generally to signal processors. More particularly, the invention relates to telephone signal processors and to voice activity detectors for integrated telecommunications processing.
- DSPs Single chip digital signal processing devices
- DSPs generally are distinguished from general purpose microprocessors in that DSPs typically support accelerated arithmetic operations by including a dedicated multiplier and accumulator (MAC) for performing multiplication of digital numbers.
- the instruction set for a typical DSP device usually includes a MAC instruction for performing multiplication of new operands and addition with a prior accumulated value stored within an accumulator register.
- a MAC instruction is typically the only instruction provided in prior art digital signal processors where two DSP operations, multiply followed by add, are performed by the execution of one instruction.
- DSP operations multiply followed by add
- DSPs digital filtering.
- a DSP is typically programmed with instructions to implement some filter function in the digital or time domain.
- the equation Y n may be evaluated by using a software program. However in some applications, it is necessary that the equation be evaluated as fast as possible.
- One way to do this is to perform the computations using hardware components such as a DSP device programmed to compute the equation Y n .
- the multiple DSPs operate in parallel to speed the computation process, hi this case, the multiplication of terms is spread across the multipliers of the DSPs equally for simultaneous computations of terms.
- the adding of terms is similarly spread equally across the adders of the DSPs for simultaneous computations, hi vectorized processing, the order of processing terms is unimportant since the combination is associative. If the processing order of the terms is altered, it has no effect on the final result expected in a vectorized processing of a function.
- Echo cancellation is used to cancel echoes over full duplex telephone communication channels.
- the echo-cancellation process isolates and filters the unwanted signals caused by echoes from the main transmitted signal in a two-way transmission.
- Single or multiple DSP chips can be used to implement an echo canceller having finite impulse response filter to provide echo cancellation.
- echo cancellation is only one part of telecommunication processing.
- telephone processing functions are spread over multiple devices, components or boards in a telephone communication system.
- a telephone, fax, or data modem couples to a local subscriber loop 802 at one end and another local subscriber loop 802' at an opposite end.
- Each of the local subscriber loops 802 and 802' couple to 2-wire/4-wire hybrid circuits 804 and 804'.
- Hybrid circuits 804 are composed of resistor networks, capacitors, and ferrite-core transformers.
- Hybrids circuits 804 convert 4-wire telephone trunk lines 806 (a pair in each direction) running between telephone exchanges of the PSTN 812 to each of the 2-wire local subscriber loops 802 and 802'.
- the hybrid circuits 804 is intended to direct all the energy from a talker on the 4-wire trunk 806 at a far-end to a listener on a 2-wire local subscriber loop 802 at a near end.
- Echoes 810' are often formed when a speech signal from a far end talker leaves a far end hybrid 804' on a pair of the four wires 806', and arrives at the near end after traversing the PSTN 812, and may be heard by the listener at the near side, hi traditional telephone networks, an echo canceller is placed at each end of the PSTN in order to reduce and attempt to eliminate this echo.
- the prior art digital echo canceller 900 couples between the hybrid circuit 804 and the public switched telephone network (PSTN) 902 on the telephone trunk lines.
- PSTN public switched telephone network
- the governing specification for digital echo cancellers is the ITU-T recommendation G.168, Digital network echo cancellers.
- the following terms from ITU-T document G.168 are used herein and are illustrated in Figure 9.
- the end or side of the connection towards the local handset is referred to as the near end, near side or send side 910.
- the end or side of the connection towards the distant handset is referred to as the far end, far side or receive side 920.
- the part of the circuit from the near end 910 to the far end 920 is the send path 930.
- the part of the circuit from the far end to the near end is the receive path 935.
- the part of the circuit (i.e. copper wire, hybrid) in the local loop 802, between the end system ⁇ subscriber or telephone system 108 and the central-office termination of the hybrid 804, is the end path.
- Speech signals entering the echo canceller 900 from the near end 910 are the send input Sj n .
- Speech signals entering the echo canceller from the far end 920 are the received input R; n .
- Speech signals output from the echo canceller 900 to the far end 920 are the send output S out - Speech signals exiting the echo canceller to the near end 910 are the received output R oUt .
- the typical prior art digital echo canceller 900 includes the basic components of an echo estimator 902, a digital subtractor 904, and a non-linear processor 906.
- the echo-cancellation process in the typical prior art digital echo canceller 900 begins by eliminating impedance mismatches. In order to do so, the typical digital echo canceller 900 taps the receive-side input signal (Rj n ). R .n is processed to generate an estimate of Sin in the echo estimator (902). Sin serves as the reference signal for the echo cancellation process. Rin is also passed through to the near end 910 without change as the R ou t signal.
- the echo estimator 902 is a linear finite impulse response (FIR) convolution filter implemented in a DSP.
- FIR linear finite impulse response
- the estimator 902 accepts successive samples of voice on Rin (typically a 16 bit sample every 125 microseconds).
- the voice samples are multiplied with a set of filter coefficients approximating the impulse response of circuitry in the endpath to generate an echo estimation. Over time, the set of filter coefficients are changed (i.e. adapted) until they accurately represent the desired impulse response to form an accurate echo estimation.
- the echo estimation is coupled into the subtractor 904. If the echo estimation is accurate, it is substantially equivalent to the actual echo on S, n and the output from the subtractor 906 into the non-linear processor has linear echoes substantially removed.
- the non-linear processor 906 is used to remove non-linear echo sources.
- Figure 1A is a block diagram of a system utilizing the present invention.
- Figure IB is a block diagram of a printed circuit board utilizing the present invention within the gateways of the system in Figure 1A.
- FIG. 2 is a block diagram of the Application Specific Signal Processor (ASSP) of the present invention.
- ASSP Application Specific Signal Processor
- Figure 3 is a block diagram of an instance of the core processors within the ASSP of the present invention.
- Figure 4 is a block diagram of the RISC processing unit within the core processors of Figure 3.
- Figure 5 A is a block diagram of an instance of the signal processing units within the core processors of Figure 3.
- Figure 5B is a more detailed block diagram of Figure 5 A illustrating the bus structure of the signal processing unit.
- Figure 6A is an exemplary instruction sequence illustrating a program model for
- Figure 6B is a chart illustrating the permutations of the dyadic DSP instructions.
- Figure 6C is an exemplary bitmap for a control extended dyadic DSP instruction.
- Figure 6D is an exemplary bitmap for a non-extended dyadic DSP instruction.
- Figure 6E and 6F list the set of 20-bit instructions for the ISA of the present invention.
- Figure 6G lists the set of extended control instructions for the ISA of the present invention.
- Figure 6H lists the set of 40-bit DSP instructions for the ISA of the present invention.
- Figure 61 lists the set of addressing instructions for the ISA of the present invention.
- Figure 7 is a block diagram illustrating the instruction decoding and configuration of the functional blocks of the signal processing units.
- Figure 8 is a prior art block diagram illustrating a PSTN telephone network and echoes therein.
- Figure 9 is a prior art block diagram illustrating a typical prior art echo canceller for a PSTN telephone network.
- FIG. 10 is a block diagram of a packet network system incorporating the integrated telecommunications processor of the present invention.
- Figure 11 is a block diagram of the firmware telecommunication processing modules of the integrated telecommunications processor for one of multiple full duplex channels.
- Figure 12 is a flow chart of telecommunication processing from the near end to the packet network.
- Figure 13 is a flow chart of the telecommunication processing of a packet from the network into the integrated telecommunications processor into TDM signals at the near end.
- Figure 14A is a block diagram of the data flows and interaction between exemplary- functional blocks of the integrated telecommunications processor 150 for telephony processing.
- Figure 14B is a flow chart of an algorithm for performing voice activity detection.
- Figure 14C is a flow chart of an algorithm for fast Fourier transform (FFT) processing of input speech for voice activity detection.
- FFT fast Fourier transform
- Figure 14D is a flow chart for zero crossing detection for voice activity detection.
- Figure 14E is a flow chart of a process for noise detection for voice activity , detection.
- Figure 14F is a flow chart of a process for energy discrimination for voice activity detection.
- Figure 14G is a flow chart of a process for instantaneous energy discrimination for voice activity detection.
- Figure 15 is a block diagram of exemplary memory maps into the memories of the integrated telecommunications processor 150.
- Figure 16 is a block diagram of an exemplary memory map for the global buffer memory of the integrated telecommunications processor 150.
- Figure 17 is an exemplary time line diagram of reception and processing time for frames of data.
- Figure 18 is an exemplary time line diagram of how core processors of the integrated telecommunications processor 150 process frames of data for multiple communication channels.
- ASSPs application specific signal processors
- Each ASSP includes a serial mterface, a host interface, a buffer memory and four core processors in order to simultaneously process multiple channels of voice or data.
- Each core processor preferably includes a reduced instruction set computer (RISC) processor and four signal processing units (SPs).
- RISC reduced instruction set computer
- SPs signal processing units
- Each SP includes multiple arithmetic blocks to simultaneously process multiple voice and data communication signal samples for communication over IP, ATM, Frame Relay, or other packetized network.
- the four signal processing units can execute digital signal processing algorithms in parallel.
- Each ASSP is flexible and can be programmed to perform many network functions or data/voice processing functions, including voice and data compression/decompression in telecommunication systems (such as CODECs), particularly packetized telecommunication networks, simply by altering the software program controlling the commands executed by the ASSP.
- An instruction set architecture for the ASSP is tailored to digital signal processing applications including audio and speech processing such as compression/decompression and echo cancellation.
- the instruction set architecture implemented with the ASSP is adapted to DSP algorithmic structures. This adaptation of the ISA of the present invention to DSP algorithmic structures balances the ease of implementation, processing efficiency, and programmability of DSP algorithms.
- the instruction set architecture may be viewed as being two component parts, one (RISC ISA) corresponding to the RISC control unit and another (DSP ISA) to the DSP datapaths of the signal processing units 300.
- the RISC ISA is a register based architecture including 16-registers within the register file 413, while the DSP ISA is a memory based architecture with efficient digital signal processing instructions.
- the instruction word for the ASSP is typically 20 bits but can be expanded to 40-bits to control two instructions to the executed in series or parallel, such as two RISC control instruction and extended DSP instructions.
- the instruction set architecture of the ASSP has four distinct types of instructions to optimize the DSP operational mix. These are (1) a 20-bit DSP instruction that uses mode bits in control registers (i.e. mode registers), (2) a 40-bit DSP instruction having control extensions that can override mode registers, (3) a 20-bit dyadic DSP instruction, and (4) a 40 bit dyadic DSP instruction.
- All DSP instructions of the instruction set architecture of the ASSP are dyadic DSP instructions to execute two operations in one instruction with one cycle throughput.
- a dyadic DSP instruction is a combination of two DSP instructions or operations in one instruction and includes a main DSP operation (MAIN OP) and a sub DSP operation (SUB OP).
- MAIN OP main DSP operation
- SUB OP sub DSP operation
- the instruction set architecture of the present invention can be generalized to combining any pair of basic DSP operations to provide very powerful dyadic instruction combinations.
- the DSP arithmetic operations in the preferred embodiment include a multiply instruction (MULT), an addition instruction (ADD), a minimize/maximize instruction (MIN/MAX) also referred to as an extrema instruction, and a no operation instruction (NOP) each having an associated operation code ("opcode").
- MULT multiply instruction
- ADD addition instruction
- MIN/MAX minimize/maximize instruction
- NOP no operation instruction
- the present invention efficiently executes these dyadic DSP instructions by means of the instruction set architecture and the hardware architecture of the application specific signal processor.
- an integrated voice activation detector detects whether voice is present.
- the integrated voice activation detector includes a semiconductor integrated circuit having at least one signal processing unit to perform voice detection and a storage device to store signal processing instructions for execution by the at least one signal processing unit to: detect whether noise is present to determine whether a noise flag should be set, detect a predetermined number of zero crossings to determine whether a zero crossing flag should be set, detect whether a threshold amount of energy is present to determine whether an energy flag should be set, and detect whether instantaneous energy is present to determine whether an instantaneous energy flag should be set. Utilizing a combination of the noise, zero crossing, energy, and instantaneous energy flags the integrated voice activation detector determines whether voice is present.
- the system 100 includes a network 101 which is a packetized or packet- switched network, such as IP, ATM, or frame relay.
- the network 101 allows the communication of voice/speech and data between endpoints in the system 100, using packets. Data may be of any type including audio, video, email, and other generic forms of data.
- the voice or data requires packetization when transceived across the network 101.
- the system 100 includes gateways 104A and 104B in order to packetize the information received for transmission across the network 101.
- a gateway is a device for connecting multiple networks and devices that use different protocols.
- Voice and data information may be provided to a gateway 104 from a number of different sources in a variety of digital formats.
- analog voice signals are transceived by a telephone 108.
- digital voice signals are transceived at public branch exchanges (PBX) 112A and 112B which are coupled to multiple telephones, fax machines, or data modems.
- Digital voice signals are transceived between PBX 112A and PBX 112B with gateways 104A and 104B, respectively over the packet network 101.
- Digital data signals may also be transceived directly between a digital modem 114 and a gateway 104A.
- Digital modem 114 may be a Digital Subscriber Line (DSL) modem or a cable modem.
- DSL Digital Subscriber Line
- Data signals may also be coupled into system 100 by a wireless communication system by means of a mobile unit 118 transceiving digital signals or analog signals wirelessly to a base station 116.
- Base station 116 converts analog signals into digital signals or directly passes the digital signals to gateway 104B.
- Data maybe transceived by means of modem signals over the plain old telephone system (POTS) 107B using a modem 110.
- POTS plain old telephone system
- Modem signals communicated over POTS 107B are traditionally analog in nature and are coupled into a switch 106B of the public switched telephone network (PSTN).
- PSTN public switched telephone network
- analog signals from the POTS 107B are digitized and transceived to the gateway 104B by time division multiplexing (TDM) with each time slot representing a channel and one DSO input to gateway 104B.
- TDM time division multiplexing
- incoming signals are packetized for transmission across the network 101.
- Signals received by the gateways 104A and 104B from the network 101 are depacketized and transcoded for distribution to the appropriate destination.
- NIC 130 of a gateway 104 includes one or more application-specific signal processors (ASSPs) 150A-150N.
- ASSPs application-specific signal processors
- Line interface devices 131 of NIC 130 provide interfaces to various devices connected to the gateway, including the network 101. In interfacing to the network 101, the line interface devices packetize data for transmission out on the network 101 and depacketize data which is to be received by the ASSP devices. Line interface devices 131 process information received by the gateway on the receive bus 134 and provides it to the ASSP devices. Information from the ASSP devices 150 is communicated on the transmit bus 132 for transmission out of the gateway.
- a traditional line interface device is a multichannel serial interface or a UTOPIA device.
- the NIC 130 couples to a gateway backplane/network interface bus 136 within the gateway 104.
- Bridge logic 138 transceives information between bus 136 and NIC 130.
- Bridge logic 138 transceives signals between the NIC 130 and the backplane/network interface bus 136 onto the host bus 139 for communication to either one or more of the ASSP devices 150A- 15 ON, a host processor
- ASSP 150 optional local memory 145 A through 145N (generally referred to as optional local memory 145), respectively.
- Digital data on the receive bus 134 and transmit bus 132 is preferably communicated in bit wide fashion. While internal memory within each ASSP may be sufficiently large to be used as a scratchpad memory, optional local memory 145 maybe used by each of the ASSPs 150 if additional memory space is necessary.
- Each of the ASSPs 150 provide signal processing capability for the gateway.
- the type of signal processing provided is flexible because each ASSP may execute differing signal processing programs.
- Typical signal processing and related voice packetization functions for an ASSP include (a) echo cancellation; (b) video, audio, and voice/speech compression/decompression (voice/speech coding and decoding); (c) delay handling (packets, frames); (d) loss handling; (e) connectivity (LAN and WAN); (f) security (encryption/decryption); (g) telephone connectivity; (h) protocol processing (reservation and transport protocols, RSNP, TCP/IP, RTP, UDP for IP, and AAL2, AAL1, AAL5 for ATM); (i) filtering; (j) Silence suppression; (k) length handling (frames, packets); and other digital signal processing functions associated with the communication of voice and data over a communication system.
- Each ASSP 150 can perform other functions in order to transmit voice and data to the various endpoints of the system 100 within a packet data stream over a packetized network.
- FIG 2 a block diagram of the ASSP 150 is illustrated.
- Each of the core processors 200A-200D is respectively coupled to a data memory 202A-202D and a program memory 204A-204D.
- Each of the core processors 200A-200D communicates with outside channels through the multi-channel serial interface 206, the multi-channel memory movement engine 208, buffer memory 210, and data memory 202A-202D.
- the ASSP 150 further includes an external memory interface 212 to couple to the external optional local memory 145.
- the ASSP 150 includes an external host interface 214 for interfacing to the external host processor 140 of Figure IB. - Further included within the ASSP 150 are timers 216, clock generators and a phase-lock loop 218, miscellaneous control logic 220, and a Joint Test Action Group (JTAG) test access port 222 for boundary scan testing.
- the multichannel serial interface 206 may be replaced with a UTOPIA parallel interface for some applications such as ATM.
- the ASSP 150 further includes a microcontroller 223 to perform process scheduling for the core processors 200A-200D and the coordination of the data movement within the ASSP as well as an interrupt controller 224 to assist in interrupt handling and the control of the ASSP 150.
- Core processor 200 is the block diagram for each of the core processors 200A-200D.
- Data memory 202 and program memory 204 refers to a respective instance of data memory 202A-202D and program memory 204A-204D, respectively.
- the core processor 200 includes four signal processing units SP0 300A, SP1 300B, SP2 300C and SP3 300D.
- the core processor 200 further includes a reduced instruction set computer (RISC) control unit 302 and a pipeline control unit 304.
- RISC reduced instruction set computer
- the signal processing units 300A-300D perform the signal processing tasks on data while the RISC control unit 302 and the pipeline control unit 304 perform control tasks related to the signal processing function performed by the SPs 300A-300D.
- the control provided by the RISC control unit 302 is coupled with the SPs 300A-300D at the pipeline level to yield a tightly integrated core processor 200 that keeps the utilization of the signal processing units 300 at a very high level.
- the signal processing tasks are performed on the datapaths within the signal processing units 300A-300D.
- the nature of the DSP algorithms are such that they are inherently vector operations on streams of data, that have minimal temporal locality (data reuse). Hence, a data cache with demand paging is not used because it would not function well and would degrade operational performance. Therefore, the signal processing units 300A-300D are allowed to access vector elements (the operands) directly from data memory 202 without the overhead of issuing a number of load and store instructions into memory resulting, in very efficient data processing.
- the instruction set architecture of the present invention having a 20 bit instruction word which can be expanded to a 40 bit instruction word, achieves better efficiencies than NLIW architectures using 256-bits or higher instruction widths by adapting the ISA to DSP algorithmic structures.
- the adapted ISA leads to very compact and low-power hardware that can scale to higher computational requirements.
- the operands that the ASSP can accommodate are varied in data type and data size.
- the data type may be real or complex, an integer value or a fractional value, with vectors having multiple elements of different sizes.
- the data size in the preferred embodiment is 64 bits but larger data sizes can be accommodated with proper instruction coding. Referring now to Figure 4, a detailed block diagram of the RISC control unit 302 is illustrated.
- RISC control unit 302 includes a data aligner and formatter 402, a memory address generator 404, three adders 406A-406C, an arithmetic logic unit (ALU) 408, a multiplier 410, a barrel shifter 412, and a register file 413.
- the register file 413 points to a starting memory location from which memory address generator 404 can generate addresses into data memory 202.
- the RISC control unit 302 is responsible for supplying addresses to data memory so that the proper data stream is fed to the signal processing units 300A-300D.
- the RISC control unit 302 is a register to register organization with load and store instructions to move data to and from data memory 202.
- Data memory addressing is performed by RISC control unit using a 32-bit register as a pointer that specifies the address, post-modification offset, and type and permute fields.
- the type field allows a variety of natural DSP data to be supported as a "first class citizen" in the architecture. For instance, the complex type allows direct operations on complex data stored in memory removing a number of bookkeeping instructions. This is useful in supporting QAM demodulators in data modems very efficiently.
- FIG. 5 A a block diagram of a signal processing unit 300 is illustrated which represents an instance of the SPs 300A-300D.
- Each of the signal processing units 300 includes a data typer and aligner 502, a first multiplier Ml 504A, a compressor 506, a first adder Al 510A, a second adder A2 510B, an accumulator register 512, a third adder A3 510C, and a second multiplier M2 504B.
- Adders 510A-510C are similar in structure and are generally referred to as adder 510.
- Multipliers 504A and 504B are similar in structure and generally referred to as multiplier 504.
- Each of the multipliers 504A and 504B have a multiplexer 514A and 514B respectively at its input stage to multiplex different inputs from different busses into the multipliers.
- Each of the adders 510A, 510B, 510C also have a multiplexer 520A, 520B, and 520C respectively at its input stage to multiplex different inputs from different busses into the adders.
- These multiplexers and other control logic allow the adders, multipliers and other components within the signal processing units 300A-300C to be flexibly interconnected by proper selection of multiplexers.
- multiplier Ml 504A, compressor 506, adder Al 510A, adder A2 510B and accumulator 512 can receive inputs directly from external data buses through the data typer and aligner 502.
- adder 510C and multiplier M2 504B receive inputs from the accumulator 512 or the outputs from the execution units multiplier Ml 504A, compressor 506, adder Al 510A, and adder A2 51 OB.
- Program memory 204 couples to the pipe control 304 which includes an instruction buffer that acts as a local loop cache.
- the instruction buffer in the preferred embodiment has the capability of holding four instructions.
- the instruction buffer of the pipe control 304 reduces the power consumed in accessing the main memories to fetch instructions during the execution of program loops.
- Output signals are coupled out of the signal processor 300 on the Z output bus 532 through the data typer and aligner 502.
- Input signals are coupled into the signal processor 300 on the X input bus 531 and Y input bus 533 through the data typer and aligner 502.
- the data typer and aligner 502 has a different data bus to couple to each of multiplier Ml 504A, compressor 506, adderAl 510A, adder A2 510B, and accumulator register AR 512.
- Output data is coupled from the accumulator register AR 512 into the data typer and aligner 502.
- Multiplier Ml 504A has buses to couple its output into the inputs of the compressor 506, adder Al 510A, adder A2 510B, and the accumulator registers AR 512.
- Compressor 506 has buses to couple its output into the inputs of adder Al 510A and adder A2 510B.
- Adder Al 510A has a bus to couple its output into the accumulator registers 512.
- Adder A2 510B has buses to couple its output into the accumulator registers 512.
- Accumulator registers 512 has buses to couple its output into multiplier M2 504B, adder A3 510C, and data typer and aligner 502.
- Adder A3 510C has buses to couple its output into the multiplier M2 504B and the accumulator registers 512.
- Multiplier M2 504B has buses to couple its output into the inputs of the adder A3 510C and the accumulator registers AR 512.
- the instruction set architecture of the ASSP 150 is tailored to digital signal processing applications including audio and speech processing such as compression/decompression and echo cancellation.
- the instruction set architecture implemented with the ASSP 150 is adapted to DSP algorithmic structures.
- the adaptation of the ISA of the present invention to DSP algorithmic structures is a balance between ease of implementation, processing efficiency, and programmability of DSP algorithms.
- the ISA of the present invention provides for data movement operations, DSP/arithmetic/logical operations, program control operations (such as function calls/returns, unconditional/conditional jumps and branches), and system operations (such as privilege, interrupt/trap/hazard handling and memory management control).
- an exemplary instruction sequence 600 is illustrated for a DSP algorithm program model employing the instruction set architecture of the present invention.
- the instruction sequence 600 has an outer loop 601 and an inner loop 602. Because DSP algorithms tend to perform repetitive computations, instructions 605 within the inner loop 602 are executed more often than others.
- Instructions 603 are typically parameter setup code to set the memory pointers, provide for the setup of the outer loop 601, and other 2X20 control instructions.
- Instructions 607 are typically context save and function return instructions or other 2X20 control instructions. Instructions 603 and 607 are often considered overhead instructions which are typically infrequently executed.
- Instructions 604 are typically to provide the setup for the inner loop 602, other control through 2x20 control instructions, or offset extensions for pointer backup.
- Instructions 606 typically provide tear down of the inner loop 602, other control through 2x20 control instructions, and combining of datapath results within the signal processing units.
- Instructions 605 within the inner loop 602 typically provide inner loop execution of DSP operations, control of the four signal processing units 300 in a single instruction multiple data execution mode, memory access for operands, dyadic DSP operations, and other DSP functionality through the 20/40 bit DSP instructions of the ISA of the present invention. Because instructions 605 are so often repeated, significant improvement in operational efficiency may be had by providing the DSP instructions, including general dyadic instructions and dyadic DSP instructions, within the ISA of the present invention.
- the instruction set architecture of the ASSP 150 can be viewed as being two component parts, one (RISC ISA) corresponding to the RISC control unit and another (DSP ISA) to the DSP datapaths of the signal processing units 300.
- the RISC ISA is a register based architecture including sixteen registers within the register file 413, while the DSP ISA is a memory based architecture with efficient digital signal processing instructions.
- the instruction word for the ASSP is typically 20 bits but can be expanded to 40-bits to control two RISC or DSP instructions to be executed in series or parallel, such as a RISC control instruction executed in parallel with a DSP instruction, or a 40 bit extended RISC or DSP instruction.
- the ISA of the ASSP 150 which accelerates these calculations allows efficient chaining of different combinations of operations. Because these type of operations require three operands, they must be available to the processor. However, because the device size places limits on the bus structure, bandwidth is limited to two vector reads and one vector write each cycle into and out of data memory 202. Thus one of the operands, such as B or C, needs to come from another source within the core processor 200. The third operand can be placed into one of the registers of the accumulator 512 or the RISC register file 413. In order to accomplish this within the core processor 200 there are two subclasses of the 20-bit DSP instructions which are (1) A and
- B specified by a 4-bit specifier
- C and D by a 1-bit specifier
- the two 20-bit sections are control instructions that are executed serially
- the two 20-bit sections are DSP instructions that are executed serially
- the two 20 bit sections form one extended DSP instruction which are executed simultaneously.
- the ISA of the ASSP 150 is fully predicated providing for execution prediction.
- a 6-bit specifier is used in the DSP extended instructions to access operands in memory and registers.
- the MSB (Bit 5) indicates whether the access is a memory access or register access. In the preferred embodiment, if Bit 5 is set to logical one, it denotes a memory access for an operand. If Bit 5 is set to a logical zero, it denotes a register access for an operand.
- Bit 5 If Bit 5 is set to 1, the contents of a specified register (rX where X: 0-7) are used to obtain the effective memory address and post-modify the pointer field by one of two possible offsets specified in one of the specified rX registers. If Bit 5 is set to 0, Bit 4 determines what register set has the contents of the desired operand. If Bit-4 is set to 0, then the remaining specified bits 3:0 control access to the registers within the register file 413 or to registers within the signal processing units 300.
- rX where X: 0-7
- Multiply Controls the execution of the main multiplier connected to data buses from memory. Controls: Rounding, sign of multiply
- Controls absolute value control of the inputs, Global or running max/min with T register, TR register recording control Second operation: add, sub, mult, mac, min, max
- the ASSP 150 can execute these DSP arithmetic operations in vector or scalar fashion. In scalar execution, a reduction or combining operation is performed on the
- the 20-bit DSP instruction words have 4-bit operand specifiers that can directly access data memory using 8 address registers (r0-r7) within the register file 413 of the RISC control unit 302.
- the method of addressing by the 20 bit DSP instruction word is regular indirect with the address register specifying the pointer into memory, post- modification value, type of data accessed and permutation of the data needed to execute the algorithm efficiently. All of the DSP instructions control the multipliers 504A-504B, adders 510A-510C, compressor 506 and the accumulator 512, the functional units of each signal processing unit 300A-300D.
- DSP extensions that control the lower rows of functional units within a signal processing unit 300 to accelerate block processing.
- the 40-bit control instructions with the 20 bit extensions further allow a large ⁇ immediate value (16 to 20 bits) to be specified in the instruction and powerful bit manipulation instructions.
- Efficient DSP execution is provided with 2x20-bit DSP instructions with the first 20-bits controlling the top functional units (adders 501A and 510B, multiplier 504A, compressor 506) that interface to data buses from memory and the second 20 bits controlling the bottom functional units (adder 510C and multiplier 504B) that use internal or local data as operands.
- the top functional units also referred to as main units, reduce the inner loop cycles in the inner loop 602 by parallelizing across consecutive taps or sections.
- the bottom functional units cut the outer loop cycles in the outer loop 601 in half by parallelizing block DSP algorithms across consecutive samples.
- Efficient DSP execution is also improved by the hardware architecture of the present invention.
- efficiency is improved in the manner that data is supplied to and from data memory 202 to feed the four signal processing units 300 and the DSP ' functional units therein.
- the data highway is comprised of two buses, X bus 531 and Y bus 533, for X and Y source operands, and one Z bus 532 for a result write. All buses, including X bus 531, Y bus 533, and Z bus 532, are preferably 64 bits wide. The buses are uni-directional to simplify the physical design and reduce transit times of data.
- the parallel load field can only access registers within the register file 413 of the RISC control unit 302.
- the four signal processing units 300A-300D in parallel provide four parallel MAC units (multiplier 504A, adder 510A, and accumulator 512) that can make simultaneous computations. This reduces the cycle count from 4 cycles ordinarily required to perform four MACs to only one cycle.
- DYADIC DSP INSTRUCTIONS All DSP instructions of the instruction set architecture of the ASSP 150 are dyadic DSP instructions within the 20 bit or 40 bit instruction word.
- a dyadic DSP instruction informs the ASSP in one instruction and one cycle to perform two operations.
- FIG 6B is a chart illustrating the permutations of the dyadic DSP instructions.
- the dyadic DSP instruction 610 includes a main DSP operation 611 (MAIN OP) and a sub DSP operation 612 (SUB OP), a combination of two DSP instructions or operations in one dyadic instruction.
- the instruction set architecture of the present invention can be generalized to combining any pair of basic DSP operations to provide very powerful dyadic instruction combinations.
- Compound DSP operational instructions can provide uniform acceleration for a wide variety of DSP algorithms not just multiply-accumulate intensive filters.
- the DSP instructions or operations in the preferred embodiment include a multiply instruction (MULT), an addition instruction (ADD), a minimize/maximize instruction (MIN/MAX) also referred to as an extrema instruction, and a no operation instruction (NOP) each having an associated operation code ("opcode"). Any two DSP instructions can be combined together to form a dyadic DSP instruction.
- the NOP instruction is used for the MAIN OP or SUB OP when a single DSP operation is desired to be executed by the dyadic DSP instruction.
- the general DSP instructions such as vector and scalar operations of multiplication or addition, positive or negative multiplication, and positive or negative addition (i.e. subtraction).
- FIG. 6C illustrates bitmap syntax for a control extended dyadic DSP instruction
- Figure 6D illustrates bitmap syntax for a non-extended dyadic DSP instruction.
- the instruction word is the twenty most significant bits of a forty bit word while the extended bitmap syntax has an instruction word of forty bits.
- the three most significant bits (MSBs), bits numbered 37 through 39, in each indicate the MAIN OP instruction type while the SUB OP is located near the middle or end of the instruction bits at bits numbered 20 through 22.
- the MAIN OP instruction codes are 000 for NOP, 101 for ADD, 110 for MIN/MAX, and 100 for MULT.
- the SUB OP code for the given DSP instruction varies according to what MAIN OP code is selected.
- the SUB OPs are 000 for NOP, 001 or 010 for ADD, 100 or 011 for a negative ADD or subtraction, 101 or 110 for MIN, and 111 for MAX.
- the MAIN OP and the SUB OP are not the same DSP instruction although alterations to the hardware functional blocks could accommodate it.
- bitmap syntax of the dyadic DSP instruction can be converted into text syntax for program coding.
- its text syntax for multiplication or MULT is
- vmuln refers to either positive vector multiplication or negative vector multiplication being selected as the MAIN OP.
- smax refers to either vector add, vector subtract, vector maximum, scalar add, scalar subtraction, or scalar maximum being selected as the SUB
- the next field, "da”, refers to selecting one of the registers within the accumulator for storage of results.
- the field “sx” refers to selecting a register within the RISC register file 413 which points to a memory location in memory as one of the sources of operands.
- the field “sa” refers to selecting the contents of a register within the accumulator as one of the sources of operands.
- the field “sy” refers to selecting a register within the RISC register file 413 which points to a memory location in memory as another one of the sources of operands.
- psl)]" refers to pair selection of keyword PS0 or PSI specifying which are the source-destination pairs of a parallel-store control register.
- Figure 6E and 6F lists of the set of 20-bit DSP and control instructions for the ISA of the present invention is illustrated.
- Figure 6G lists the set of extended control instructions for the ISA of the present invention.
- Figure 6H lists the set of 40-bit DSP instructions for the ISA of the present invention.
- Figure 61 lists the set of addressing instructions for the ISA of the present invention.
- the signal processor 300 includes the final decoders 704A through 704N, and multiplexers 720A through 720N.
- the multiplexers 720A through 72 ON are representative of the multiplexers 514, 516, 520, and 522 in Figure 5B.
- the predecoding 702 is provided by the RISC control unit 302 and the pipe control 304.
- An instruction is provided to the predecoding 702 such as a dyadic DSP instruction 600.
- the predecoding 702 provides preliminary signals to the appropriate final decoders 704A through 704N on how the multiplexers 720A through 720N are to be selected for the given instruction.
- the MAIN OP generally, if not a NOP, is performed by the blocks of the multiplier Ml 504A, compressor 506, adder Al 510A, and adder A2 510B. The result is stored in one of the registers within the accumulator register AR 512.
- the SUB OP generally, if not a NOP, is performed by the blocks of the adder A3 510C and the multiplier M2 504B.
- the dyadic DSP instruction is to perform is an ADD and MULT
- the ADD operation of the MAIN OP is performed by the adder Al 510A
- the SUB OP is performed by the multiplier Ml 504A.
- the predecoding 720 and the final decoders 704A through 704N appropriately select the respective multiplexers 720A through 720B to select the MAIN OP to be performed by the adder Al 510A and the SUB OP to be performed by the multiplier M2 504B.
- multiplexer 520A selects inputs from the data typer and aligner 502 in order for adder Al 510A to perform the ADD operation
- multiplexer 522 selects the output from adder 510A for accumulation in the accumulator 512
- multiplexer 514B selects outputs from the accumulator 512 as its inputs to perform the MULT SUB OP.
- the MAIN OP and SUB OP can be either executed sequentially (i.e. serial execution on parallel words) or in parallel (i.e. parallel execution on parallel words). If implemented sequentially, the result of the MAIN OP may be an operand of the SUB OP.
- the final decoders 704A through 704N have their own control logic to properly time the sequence of multiplexer selection for each element of the signal processor 300 to match the pipeline execution of how the MAIN OP and SUB OP are executed, including sequential or parallel execution.
- the RISC control unit 302 and the pipe control 304 in conjunction with the final decoders 704A through 704N pipelines instruction execution by pipelining the instruction itself and by providing pipelined control signals. This allows for the data path to be reconfigured by the software instructions each cycle.
- FIG. 10 a detailed system block diagram of the packetized telecommunication communication network 100' is illustrated.
- an end system 108A is at a near end while an end system 108B is at a far end.
- the end systems 108 A and/or 108B can be a telephone, a fax machine, a modem, wireless pager, wireless cellular telephone or other electronic device that operates over a telephone communication system.
- the end system 108A couples to switch 106A which couples into gateway 104A.
- the end system 108B couples to switch 106B which couples into gateway 104B.
- Gateway 104A and gateway 104B couple to the packet network 101 to communicate voice and other telecommunication data between each other using packets.
- Each of the gateways 104A and 104B include network interface cards (NIC) 130A-130N, a system controller board 1010, a framer card 1012, and an Ethernet interface card 1014.
- the network interface cards (NIC) 130A-130N in the gateways provide telecommunication processing for multiple communication channels over the packet network 101.
- the NICs 130 couple packet data into and out of the system controller board 1010.
- the packet data is packetized and depacketized by the system controller board 1010.
- the system controller board 1010 couples the packets of packet data into and out of the Ethernet interface card 1014.
- the Ethernet interface card 1014 of the gateways transmits and receives the packets of telecommunication data over the packet network 101.
- the NICs 130 couple time division multiplexed (TDM) data into and out of the framer card 1012.
- TDM time division multiplexed
- the framer card 1012 frames the data from multiple switches 106 as time division multiplexed data for coupling into the network interface cards 130.
- the framer card 1012 pulls data out of the framed TDM data from the network interface cards 130 for coupling into the switches 106.
- Each of the network interface cards 130 includes a micro controller (cPCI controller) 140 and one or more of integrated telecommunications processors 150A-150N.
- cPCI controller micro controller
- each of the integrated telecommunications processors 15 ON includes one or more 5C/DSP core processor 200, one or more data memory (DRAM) 202, one or more program memory (PRAM) 204, one or more serial TDM interface ports 206 to support multiple TDM channels, a bus controller or memory movement engine 208, a global or buffer memory 210, a host or host bus interface 214, and a microcontroller (MIPS) 223.
- Firmware flexibly controls the functionality of the blocks in the integrated telecommunications processor 150 which can vary for each individual channel of communication.
- One full duplex channel consists of two time-division multiplexed (TDM) time slots on the TDM or near side and two packet data channels on the packet network or far side, one for each direction of communication.
- TDM time-division multiplexed
- the telecommunication processing provided by the firmware can provide telephony processing for each given channel including one or more of network echo cancellation 1103, dial tone detection 1104, voice activity detection 1105, dual-tone multi-frequency (DTMF) signal detection 1106; dual- tone multi-frequency (DTMF) signal generation 1107; dial tone generation 1108; GJxxx voice encoding (i.e. compression) 1109; GJxxx voice decoding (i.e. decompression) 1110, and comfort noise generation (CNG) 1111.
- DTMF dual-tone multi-frequency
- DTMF dual- tone multi-frequency
- CNG comfort noise generation
- the firmware for each channel is flexible and can also provide GSM decoding/encoding, CDMA decoding/encoding, digital subscriber line (DSL), modem services including modulation/demodulation, fax services including modulation/demodulation and/or other functions associated with telecommunications services for one or more communication channels. While -Law / A-Law decoding 1101 and -Law / A-Law encoding 1102 can be performed using firmware, in one embodiment it is implemented in hardware circuitry in order to speed the encoding and decoding of multiple communication channels.
- the integrated telecommunications processor 150 couples to the host processor 140 and a packet processor 1120.
- the host processor 140 loads the firmware into the integrated telecommunications processor to perform the processing in a voice over packet (VoP) network system or packetized network system.
- the -Law / A-Law decoding 1101 decodes encoded speech into linear speech data.
- the -Law / A-Law encoding 1102 encodes linear speech data into -Law / A-Law encoded speech.
- the integrated telecommunications processor 150 includes hardware GJ11 -Law / A-Law decoders and -Law / A-Law encoders.
- the hardware conversion of A-law/ -law encoded signals into linear PCM samples and vice versa is optional depending upon the type of signals received.
- the TDM signals at the near end are encoded speech signals.
- the integrated telecommunications processor 150 receives TDM signals from the near end and decodes them into pulse-code modulated (PCM) linear data samples S; n . These PCM linear data samples S; n are coupled into the network echo-cancellation module 1103.
- the network echo-cancellation module 1103 removes an echo estimated signal from the PCM linear data samples Sj n to generate PCM linear data samples S out .
- the PCM linear data samples S 0ut are provided to the DTMF detection module 1106 and the voice-activity detection and comfort-noise generator module 1105.
- the output of the Network Echo Canceller is coupled into the Tone Detection module 1104, the DTMF Detection module 1106, and the Voice Activity Detection module 1105.
- Control signals from the Tone Detection module 1104 are coupled back into the Network Echo Cancellation module 1103.
- the decoded speech samples from the far end are PCM linear data samples Rin and are coupled into the network echo cancellation module 1103.
- the network echo cancellation module 1103 copies Rj n for echo cancellation purposes and passes it out as PCM linear data samples R oUt .
- the PCM linear data samples R ou t are coupled into the mu-law and A-law encoding module 1102.
- the PCM linear data samples R out are encoded into mu-law and A-law encoded speech and interleaved into the TDM output signals of the TDM channel Output to the near end.
- the interleaving for framing of the data is performed after the linear to A-law/mu-law conversion by a Framer (not shown in Figure 11) which puts the individual channel data into different time slots. For example, for TI signaling there are 24 such time slots for each TI frame.
- the Network Echo Cancellation module 1103 has two inputs and two outputs because it has full duplex interfaces with both the TDM channels and the packet network via the VX-Bus.
- the network echo cancellation module 1103 cancels echoes from linear as well as non-linear sources in the communication channel.
- the network echo cancellation module 1103 is specifically tailored to cancel non-linear echoes associated with the packet delays/latency generated in the packetized network.
- the tone detection module 1104 receives both tone and voice signals from the network cancellation module 1103.
- the tone detection module 1104 discriminates the tones from the voice signals in order to determine what the tones are signaling.
- the tone detection module determines whether or not the tones from the near end are call progress tones (dial tone, busy tone, fast busy tone, etc.) signaling on-hook, ringing, off-hook or busy, or a fax/modem call. If a far end is dialing the near end, the call progress tones of on-hook, ringing, or off-hook or busy signal is translated into packet signals by the tone • detection module for transmission over the packet network to the far end. If the tone detection module determines that fax/modem tones are present indicating that the near end is initiating a fax/modem call, further voice processing is bypassed and the echo cancellation by the network echo cancellation module 1103 is disabled.
- call progress tones dial tone, busy tone, fast busy tone, etc.
- the tone detection module 1104 uses infinite impulse-response (IIR) filters and accompanying logic. When a FAX or modem tone signaling tone is detected, the signaling tones help control the respective signaling event.
- the tone detection module 1104 detects the presence of several in-band tones at specific frequencies, checks their cadences, signals their presence to the echo cancellation module 1103, and prompts other modules to take appropriate actions.
- the tone detection module 1104 and the DTMF detection module operate in parallel with the network echo canceller 1103.
- the tone detection module can detect true tones with signal amplitude levels from 0 dB to -40 dB in the presence of a reasonable amount of noise.
- the tone detection module can detect tones within a reasonable neighborhood of center frequency with detection delays within a prescribed limit.
- the tone detection module matches the tone cadences, as required by the tone-cadence rules defined by the ITU/TIA standards. To achieve the above properties, certain trade-offs are necessary in that the tone detection module must adjust several energy thresholds, the filter roll-off rate, and the filter stopband attenuation. Furthermore, the tone detection module is easily upgradeable to allow detection of additional tones simply by updating the firmware.
- the current telephony-related tones that the tone-detection module 1104 can detect are listed in the following table: Tones the Tone-Detection Module Detects
- the echo canceller When a 2100-Hz tone with phase reversal is detected indicating a V-series modem operation the echo canceller is shut off temporarily. When the tone detection module detects facsimile tones, the echo canceller is shut off temporarily.
- the tone detection module can also detect the presence of narrowband signals, which can be control signals to control the actions of the echo cancellation module 1103.
- the tone detection modules function both during call set up and while the call progress through termination of the communication channel for the call. Any tone which is sent, generated, or detected before the actual call or communication channel is established, is referred to as an out-of-band tone. Tones which are detected during a call, after the call has been set-up, are referred to as in-band tones.
- the Tone Detector in it's most general form, is capable of detecting many signaling tones.
- the tones that are detected include the call progress tones such as a Ringing Tone, a Busy Tone, a Fast Busy Tone, a Caller ID Tone, a Dial Tone, and other signaling tones which vary from country to country.
- The, call progress tones control the handshaking required to set up a call.
- in-band tones Once a call is established, all the tones which are generated and detected are referred to as in-band tones.
- the same Tone Detectors and Generators Blocks are used both for in-band and out-of band tone detection and generation. In most conversations, speakers only voice speech about 35% of the time.
- silence at one end which is not transmitted to an opposite end, needs to be simulated and inserted into the call at the opposite end.
- the background or Comfort-Noise Generation (CNG) module 1105 simulates silence or quite time at an end by adding background noise such as a comforting 'hiss'.
- the CNG module 1105 can simulate ambient background noise of varying levels.
- An echo-cancellation setup message can be used to control the CNG module as an external parameter.
- the comfort noise generation module alleviates the effects of switching in and out as heard by far-end talkers when they stop talking.
- the near-end noise level is used to determine an appropriate level of background noise to be simulated and inserted at the So ut (Send Out) Port. However before silence can be simulated by the CNG module 1105, it first must be detected.
- the Voice- Activity Detection (VAD) module 1105 is used to detect the presence or absence of silence in a speech segment.
- VAD Voice- Activity Detection
- background noise energy is estimated and an encoder therein generates a Silence-Insertion Description (SID) frame.
- SID Silence-Insertion Description
- the SID frame is transmitted to an opposite end to indicate that silence is to be simulated at the estimated background noise energy level.
- the CNG module 1111 In response to receiving an SID frame at the opposite end (i.e., the Far End), the CNG module 1111 generates a corresponding comfort noise or simulated silence for a period of time.
- the CNG uses the received level of the ambient background noise from the SID frame to produce a level of comfort noise (also called 'white noise' or 'pink noise' or simulated silence) that replaces the typical background noises that have been removed, thereby assuring the far- end person that the connection has not been broken.
- the VAD module 1105 determines when the comfort noise is to be turned on (i.e. a quiet period is detected) and when comfort noise is to be turned off (i.e. the end user is talking again).
- the VAD 1105 (in the Send Path) and CNG module 1111 (in the Receive Path) work effectively together at two different ends so that speech is not clipped during the quiet period and comfort noise is appropriately generated.
- the VAD module 1105 includes an Adaptive Level Controller (ALC) that ensures a constant output level for varying levels of near-end inputs.
- the adaptive level controller includes a variable gain amplifier to maintain the constant output level.
- the adaptive level controller includes a near-end energy detector to detect noise in the near-end signal. When the near end energy detector detects noise in the near-end signal the ALC is disabled so that undesirable noise is not amplified.
- the DTMF detection module 1106 performs dual-tone multiple frequency detection necessary to detect DTMF tones as telephone signals.
- the DTMF detection module receives signals on Sout from the echo cancellation module 1103.
- the DTMF detection module 1106 is always active, even during normal conversation in case DTMF signals are transmitted during a conversation.
- the DTMF detection module does not disable echo cancellation when DTMF tones are detected.
- the DTMF detection module includes narrow-band filters to detect special tones and DTMF dialing tones.
- the GJxxx speech encoding module 1109 and decoding module 1110 are used to compress/decompress speech signals and are not used for control signaling or dialing tones, the DTMF detection module may be used as appropriate to control sequencing, loading, and the execution of CODEC firmware.
- the DTMF detection module 1106 detects the DTMF tones and includes a decoder to decode the tones to determine which telephone keypad button was pressed.
- the DTMF detection module 1106 is based on a Goertzel algorithm and meets all conditions of the Bellcore DTMF decoder tests as well as Mitel decoder tests.
- the DTMF detection module 1106 indicates which dialpad key a sender has pressed after processing a few frames of data.
- the DTMF detection module can be adapted to receive user-defined parameters.
- the user defined parameters can be varied to optimize the DTMF detector for specific receiving conditions such as the thresholds for both of the frequencies made up by the 'rows' and 'columns' of the DTMF keypad, thresholds for acceptable twist ratios (the ratio of powers between the higher and lower frequencies), silence level, signal-to-noise ratios, and harmonic ratios.
- the DTMF generation module 1107 provides dual- tone multiple frequency (DTMF) generation necessary to generate DTMF tones for telephone signals.
- the encoding process in the DTMF generation module 1107 generates one of the various pairs of DTMF tones.
- the DTMF generation module 1107 generates digitized dual-tone multi- frequency samples for a dialpad key depression at the far end.
- the DTMF generation module 1107 is also always active, even during normal conversation.
- the DTMF generation module 1107 includes narrow-band filters to generate special tones and DTMF dialing tones.
- the DTMF generation module 1107 receives a DTMF packet from the far end over the packet network.
- the DTMF generation module 1107 includes a DTMF decoder to decode the DTMF packet and properly generate tones.
- the DTMF packet payload includes such information as the key or digit that was pressed that is to be played (i.e. dialpad key coordinates), duration to be played (Number of successive 125 microsecond samples during which the tone is enabled and Number of successive 125 microsecond samples during which the tone is shut off disabled), amplitude level (Lower- frequency amplitude level in dB and Upper-frequency amplitude level in dB) and other information.
- the DTMF generation module 1107 can generate DTMF signaling tones having the required signal amplitude levels and timing for the appropriate digit/tone.
- the DTMF tones generated by the DTMF generation module 1107 are coupled into the echo canceller on R; n .
- the tone generation module 1108 operates similar to the DTMF generation module 1107 but generates the specific tones that provide telephony signals.
- the tones generated by the tone generation module include tones to signal On-hook/off-hook, Ringing, Busy, and special tones to signal FAX/modem calls.
- a tone packet is received from the far end over the packet network and is decoded and the parameters of the tone are determined.
- the tone generation module 1108 generates tone similar to the DTMF generation module 1107 previously described using narrowband filters.
- the GJxx encoding module 1109 provides speech compression before being packetized.
- the GJxx encoding module 1109 receives speech in a linear 64-Kbps pulse- code modulation (PCM) format from the network echo cancellation module 1103.
- the speech is compressed by the GJxx encoding module 1109 using one of the compression standards specified for low bit-rate voice (LBRV) CODECs, including the ITU-T internationally standardized GJxx series.
- LBRV low bit-rate voice
- Many speech CODECs can be chosen. However, the selected speech CODEC determines the block size of speech samples and the algorithmic delay.
- the GJxx decoding module 1110 provides speech decompression of signals received from the far end over the packet network.
- the decompressed speech is coupled into the network echo cancellation module 1103.
- the decompression algorithm of the GJxx decoding module 1110 needs to match the compression algorithm of the GJxx encoding module 1109.
- the GJxx decoding module 1110 and the GJxx encoding module 1109 are referred to as a CODEC (coder-decoder).
- CODEC coder-decoder
- the ITU CODECs include G.711 , GJ22, G.723.1 ,
- the companded 8-bit PCM data on the TDM channel input is converted into 16-bit linear PCM for processing in the processor 150 and is re-converted back into 8- bit PCM for outputting on the TDM channel output.
- a flow chart diagram of the telephony processing of linear data (Sj n ) from a near end to packet data on the network side at a far end is illustrated.
- Near in data S; n is provided to the integrated telecommunications processor 150.
- the integrated telecommunications processor 150 After echo cancellation is performed at step 1203 and/or if the echo cancellation module 1103 is enabled, the integrated telecommunications processor 150 jumps to the tone detection step 1205 where the data is coupled into tone detection module 1104. The processor 150 goes to step 1207. At step 1207, a determination is made whether a fax tone is present. If the fax tone is present at step 1207, the integrated telecommunications processor 150 jumps to step 1209 to provide fax processing. If no fax tone is present at step 1207, further interpretation of the result by the tone detection module occurs at step 1211.
- the echo cancellation disable tone has been detected and the energy of the tone is greater than a given predetermined threshold which causes the echo cancellation module to be disabled to cancel newly arriving Sin signals.
- the Echo Canceller block is given an indication through a control signal to disable Echo Cancellation.
- the echo cancellation disable tone was not detected and the energy of the tone is less than the given predetermined threshold.
- the echo cancellation module is ' enabled or remains enabled if already in such state.
- the Echo Canceller block is given an indication through a control signal to enable Echo Cancellation. This may indicate the end of Echo Canceller Disable Tone.
- the predetermined threshold level is a cutoff level to determine whether or not an
- Echo Canceller Disable Flag should be turned OFF. If the Tone Energy drops below a predetermined threshold, the Echo Cancellation disable flag is turned OFF. This flag is coupled into the Echo Canceller module. The Echo Canceller module is enabled or disabled in response to the echo cancellation disable flag. If the Tone energy is greater than the pre-determined threshold, then the processor jumps to step 1213 as described above. In either case, whether or not the echo cancellation disable flag is set true or false or at steps 1213 or 1217, the next step in processing is the VAD module at step 1219.
- the data signal Sin is coupled into the voice activity detector module 1105 which is used to detect periods of voice/DTMF/tone signals and periods of silence that may be present in the data signal Sin.
- the processor 150 jumps to step 1221.
- step 1221 a determination is made whether silence had been detected. If silence has been detected, the integrated telecommunications processor 150 jumps to step 1223 where an SID packet is prepared for transmission out as a packet on the packet network at the far end. If no silence is detected at step 1221, the processor couples the signal Sin into the ambient level control (ALC) module (not shown in FIG. 11). At step 1225, the ALC amplifies or de-amplifies the signal S; n to a constant level. Integrated telecommunications processor 150 then jumps to step 1227 where DTMF/Generalized Tone detection is performed by the DTMF/Generalized Tone detection module 1106. The processor goes to step 1229. At step 1229 a determination is made whether DTMF or tone signals have been detected.
- ALC ambient level control
- integrated telecommunications processor 150 If DTMF or tone signals have been detected, integrated telecommunications processor 150 generates DTMF or tone packets at step 1231 for transmission out the packet network at the far end. If no DTMF or tone signals are detected at step 1229, the signal N is a voice/speech signal and the GJXX encoding module 1109 encodes the speech into a speech packet at step 1233. A speech packet 1235 is then transmitted out the packet network side to the far end:
- FIG 13 a flow chart diagram of the telephony processing of packet data from the network side at the far end by the integrated telecommunications processor 150 into R oUt signals at the near end is illustrated.
- the integrated telecommunications processor 150 receives packet data from the far end over the packet network 101.
- a determination is made as to what type of packet has been received.
- the integrated telecommunications processor 150 is expecting one of five types of packets.
- the five packet types that are expected are a fax packet 1303, a DTMF packet 1304, a Tone packet 1305, or a speech or SID packet 1306.
- step 1301 If at step 1301 a determination has been made that a fax packet 1303 has been received, data from the packet is coupled into a fax demodulation module by the integrated telecommunications processor at step 1308. At step 1308, the fax demodulation module demodulates the data from the packet using fax demodulation into Rout signals at the near end. If at step 1301 a determination has been made that a DTMF packet 1304 has been received, the data from the packet is coupled into the DTMF generation module 1107 at step 1310. At step 1310, the DTMF generation module 1107 generates DTMF tones from the data in the packet Rout signals at the near end.
- the packet received is determined to be a tone packet 1305, the data from the packet is coupled into the tone generation module 1108 at step 1312. At step 1312, the tone generation module 1108 generates tones as Rout signals at the near end. If at step 1301 a determination has been made that speech or SID packets 1306 have been received, the data from the packet is coupled into the GJxx decoding module 1110 at step 1314. At step 1314, the GJxx decoding module 1110 decompresses the speech or SID data from the packet into Rout signals at the near end.
- the integrated telecommunications processor 150 jumps to step 1318. If at step 1318, the echo canceller flag is enabled, the R out signals from the respective module is coupled into the echo cancellation module. These R oUt signals are the Far End Input to the Echo Canceller whose echo, if not cancelled, rides on the Near End Signal when it gets transmitted to the other end. At step 1318, the respective R oUt signal from a module in conjunction with the S m signal and the Echo Canceller Enable Flag from the nearend is used to perform echo canceling.
- the Echo Canceller Enable Flag is a binary flag which turns ON and OFF the Echo Canceling operation in step 1318. When this flag is ON, the NearEndln signals are processed to cancel the potential echo of the FarEnd. When this flag is OFF, the NearEndln signal by-passes the Echo Canceling as is.
- FIG 14A a block diagram of the data flows and interaction between exemplary functional blocks of the integrated telecommunications processor 150 for telephony processing is illustrated.
- Theie are two data flows in the voice over packet (VOP) system provided by the integrated telecommunications processor 150.
- the two data flows are TDM-to-Packet and Packet-to-TDM which are both executed in tandem to form a full duplex system.
- the functional blocks in the TDM-to-Packet data flow includes the Echo Canceller 1403, the tone detector 1404, the voice activity detector (VAD) 1405, the automatic level controller (ALC) 1401, DTMF detector 1405, and packetizer 1409.
- the Echo Canceller 1403 substantially removes a potential echo signal from the near end of gateway.
- the Tone Detector 1404 controls the echo canceller and other modules of the integrated telecommunications processor 150.
- the tone detector is for detecting the EC Disable Tone, the FAXCED tone, the FAXCNG tone and V21 '7E' flags.
- the tone detector 1404 can also be programmed to detect a given number of signaling tones also.
- the VAD 1405 generates Silence Information Descriptor (SID) when speech is absent in the signal from the near end.
- the ALC 1401 optimizes volume (amplitude) of speech.
- the DTMF detector 1405 looks for tones representing DTMF digits.
- the Packetizer 1409 packetizes the appropriate payloads in order to send packets.
- the functional blocks in the Packet to TDM Flow include: the Depacketizer 1410, the Comfort Noise Generator (CNG) 1420, the DTMF Generator 1407, the PCM to linear converter 1421, and the optional Narrowband signal detector 1422.
- the Decoder 1410 depackets the packet type and routes it appropriately to the CNG 1420, the PCM to linear converter 1421 or the DTMF generator 1407.
- the CNG 1420 generates comfort noise based on an SID packet.
- the DTMF generator 1407 generates DTMF signals of a given amplitude and duration.
- the optional Narrowband signal detector 1422 detects when it is undesirable for the echo canceller to cancel the echo of certain tones on Rin side.
- the PCM to Linear converter 1421 converts A-law/mu-law encoded speech into 16-bit linear PCM samples. However, this block can easily be replaced by a general speech decoder (e.g. GJxx speech decoder) for a given communications channel by swapping out the appropriate firmware code .
- the TDM IN/OUT block 1424 is a A-law/mu-law to linear conversion block (i.e. 1102, 1103) which occurs at the TDM interface. This could be performed by hardware or can be programmed and performed by firmware.
- the integrated telecommunications processor is a modular system. It is easy to open new communication channels and support numerous channels simultaneously as a result. These functional modules or blocks of the integrated telecommunications processor 150 interact with each other to achieve complete functionality.
- Interfunctional-block data All functional blocks of the integrated telecommunications processor 150 have permission to read this shared area in memory but only a few blocks or modules of the integrated telecommunications processor 150 have permission to write into this shared area of memory.
- the InterFB data is a fixed (reserved) area in memory starting at a memory address such as 0x0050H for example. All the functional blocks or modules of the integrated telecommunications processor 150 communicate with each other if need using this shared memory or InterFB data. The same shared memory area may be used for both
- the table below indicates a sample set of parameters that may be communicated between functional blocks in the integrated telecommunications processor 150.
- the column “Parameter Name” indicates the parameter while the “Function” column indicates the function the parameters assist in performing.
- the "Write/Read Access” column indicates what functional blocks can read or write the parameter.
- the echo canceller 1403 receives both the Sin signal and Rin signal in order to generate the Sout signal as the echo cancelled signal.
- the echo canceller 1403 also generates the Rout signal which is normally the same as Rin. That is, no further processing is performed to the Rin signal in order to generate the Rout signal in most cases.
- the echo canceller 1403 operates over both data flows in that it receives from the TDM end as well as data from the packet side.
- the echo canceller 1403 properly functions only when data is fully available in both the flows.
- the tone detector 1404 receives the output Sout from the echo canceller 1403.
- the tone detector 1404 looks for the EC Disable Tone, the FAXCED tone, the FAXCNG tone and the tones representing V21 '7E' flags.
- the tone detector functions on Sout data after the echo canceller 1403 has completed its data processing.
- the tone detector's main purpose is to confrol other modules of the integrated telecommunications processor 150 by turning them ON or OFF.
- the tone detector 1404 is basically a switching mechanism for the modules such as the Echo Canceller 1403 and the ALC 1401.
- the tone detector can write the ecdisable flag in the shared memory while the echo canceller 1402 reads it.
- the tone detector or Echo Canceller writes an ALCdisable flag in the shared memory while the ALC 1401 reads it. Most events detected by the tone detector are used by the echo canceller in one way or another. For example, the Echo Canceller 1403 is to turn OFF when an ecdisable tone is detected by the tone detector 1404. Modems usually send the /ANS signal (or ecdisable tone) to disable the echo cancellers in a network. When the tone detector 1404 of the integrated telecommunications processor 150 detects the ecdisable tone, it writes a TRUE state into the memory location representing ecdisable flag.
- the echo canceller 1403 reads the ecdisable flag to determine it is to perform echo cancellation or not. In the case its disabled, the echo canceller 1403 generates Sout as Sin with no echo canceling signal added. The ecdisable flag is updated to a FALSE state by the echo canceller 1403 when the root mean squared energy of Sin (RMS) falls below -36dbm indicating no tone signals.
- RMS root mean squared energy of Sin
- the ALC 1401 In certain cases it is undesirable for the ALC 1401 to modify the amplitude of a signal such as when sending FAX data. In this case it is desirable for the ALC 1041 to be turned ON and OFF. In most cases an ANS tone is required to turn the ALC 1401 OFF.
- the tone detector 1404 detects an ANS tone, it writes a TRUE state into the memory location for the ALC disable flag.
- the ALC 1401 reads the shared memory location for the ALC disable flag and turns itself ON or OFF in response to its state. Another condition that ALC disable flag may be turned ON could be a signal from the Echo Canceller saying there was no detected Near End signal. This may be the case when the Sout signal is below a given threshold level.
- the tone detector When the tone detector detects an EC disable tone, it turns OFF the echo canceller 1403 (G.168). When the tone detector detects a FAXCED tone(ANS), it turns OFF the ALC 1401 (G.169) and provides a data by-pass for FAX processing. When the tone detector detects a FAXCNG tone, it provides a data by pass for FAX processing. When the tone detector simultaneously detects three V21 '7E' Flags in a row, it provides a data by pass for FAX processing.
- the VAD 1405 is used to reduce the effective bitrate and optimize the bandwidth utilization.
- the VAD 1405 is used to detect silence from speech.
- the VAD encodes periods of silence by using a Silence Information Descriptor rather than sending PCM samples that represent silence, order to do so, the VAD functions over frames of data samples of Sout.
- the frame size can vary depending on situations and needs of different implementations with a typical frame representing 80 data samples of Sout.
- the VAD 1405 detects silence, it writes a voice_activity flag in the shared memory to indicate silence. It also measures the noise power level and writes a valid noise_power level into a shared memory location.
- FIG. 14B a flow chart of an algorithm for performing voice activity detection 1449 utilizing the voice activity detection module/processor 1405 is illustrated. Samples of Input Speech Signal x[n] are inputted into a framer 1450.
- the framer 1450 typically produces frames that have a length of 40 samples (5 milliseconds) or 80 samples (10 milliseconds) of pulse code modulated speech. After a frame of data has been formed by the framer 1450, the frame is analyzed by five different processes. These five processes include fast Fourier transform (FFT) processing 1451, zero crossing detection 1452, noise detection 1453, energy discrimination 1454, and instantaneous energy discrimination 1455. The processes operate on a frame by frame basis.
- FFT fast Fourier transform
- These processes 1451-1455 each set or clear a flag for the respective process (i.e. there are 5 flags) that are used in order to make a intermediate voice activity detection decision at step 1460. Further, the intermediate voice activity detection decision 1460 can then weigh the processing steps 1451-1455 in a number of ways.
- the fast Fourier transform (FFT) process 1451 can set or clear the fast Fourier transform flag.
- the zero crossing detection process 1452 can set or clear the zero crossing flag.
- the noise detection process 1453 can set or clear the noise flag.
- the energy discrimination process 1454 can set or clear the energy flag.
- the instantaneous energy discrimination process 1455 can set or clear the instantaneous energy flag.
- the intermediate voice activity detection decision 1460 is set to indicate that voice has been detected. In general this decision could also have a weighting of a previous frame, or previous frames on the different flags. Otherwise, the intermediate voice activity detection decision 1460 is cleared indicating that no voice was detected such that silence is present.
- the voice activity detection algorithm 1449 jumps to a HangOver and Speech Kick h process 1461.
- HangOver processing and Speech Kick In processing is performed and the voice activity detection flag is either set or cleared in response thereto.
- the HangOver processing 1461 looks back over prior frames to determine if a series of past frames have the voice activity detection flags set or cleared. If the Voice Activity Detection in the past frame is set, then a HangOver counter is set to a given number (e.g. 4 or 5). If the past frame has the Voice Activity Detection Flag as zero (cleared), then the Hangover counter is decremented by 1.
- the Voice Activity Detection (VAD) Flag is not set to zero unless the HangOver Counter is Zero and the current Interim VAD Decision says that this frame is not a voice frame.
- This HangOver Processing ensures a smooth transition from speech to silence.
- the Speech Kick In looks for a set number of consecutive frames (e.g. three consecutive frames) where the Interim VAD flag has been declared to be 1 (going from silence to speech) before setting the Voice Activity Detection (VAD) Flag to 1. This ensures that a spurious declaration of speech is not made while transitioning from silence to speech.
- a determination is made whether step 1461 has currently set or cleared the voice activity detection flag.
- step 1462 If a determination is made at step 1462 that the voice activity detection flag is set, the algorithm 1449 jumps to step 1463.
- the voice activity detector algorithm activates an automatic level confrol if other conditions are met It further sends a speech payload to be packetized and updates the voice activity detection flag for external interaction with other blocks of the integrated telecommunication processor 150. The algorithm 1449 then proceeds to the next frame. If at step 1462 a determination has been made that the voice activity detection flag is cleared, the algorithm 1449 jumps to step 1464.
- step 1464 the voice activity detector algorithm disables the automatic level control and causes a silence insertion description payload to be prepared. It further updates the silence insertion description payload and the voice activity detection flag for external interaction with the other modules of the integrated telecommunications processor 150.
- the FFT processing 1451 is used to find a tone signal as distinguished from speech or silence.
- the fast Fourier transform processing 1451 can begin.
- an N point digital fast Fourier transform is performed. N can be 32 or 64 or any other power of 2.
- the digital fast Fourier transform at step 1470 converts the time domain data into a given number of frequency bins for the given frame of data.
- the FFT processing 1451 then jumps to step 1472.
- the adjacent bins' squared values are added together and we get half the number of values (n/2).
- step 1474 a bin peak finder process is performed which finds 2 peaks of the sum of adjacent bins' squared values obtained in the previous step neglecting the zero frequency peak P0. P0, if a tone is present (e.g. a signaling tone), will have a very high energy level.
- bin magnitude calculator 1472 generates N divided by 2 values (N is the size of the FFT). Of the N/2 values generated by the bin magnitude calculator 1472, two peaks, PI and P2 (e.g. having the highest energy values), are selected by the bin peak finder 1474. Peaks PI and P2, if speech is present, could represent the high energy level speech harmonics.
- the processing 1451 then jumps to step 1476.
- the peak value difference is made with the peak threshold to determine if the fast Fourier transforai flag should be set or cleared.
- the lOloglO difference between the zero frequency peak P0 and the residual peak sum is generated.
- the residual peak sum is determined by summing all the bins determined in step 1470 and by subtracting the two peak values PI and P2 therefrom. Thus, the residual peak sum equals the sum of all bins - PI - P2.
- the processing then jumps to step 1478.
- the peak value difference is compared with a pre-determined peak threshold. If at step 1478 it is determined that the peak value difference is greater than or equal too the peak threshold, then the fast Fourier transform flag is set to one indicating a tone is present.
- the flow diagram for the zero crossing detector 1452 is illustrated. After the framer 1450 frames the data input samples into a frame, the zero crossing detection 1452 begins at step 1480. At step 1480, the variable J, which is the sample number within the given frame, is initialized to zero. The zero crossing detector
- step 1452 then jumps to step 1481.
- step 1481 the frame length is considered with the variable J. If it is determined at step 1481 that the frame length is greater than J, then the zero crossing detector 1452 jumps to step 1482. If it is determined that the frame length is less than or equal to J at step 1481, then the zero crossing detector jumps to step 1484 which will be discussed later.
- step 1482 the current data sample x[j] is multiplied together with the previous sample x[j-l] which is compared to zero to determine if there is a sign reversal between adjacent samples. If step 1482 determines that there is no sign reversal between samples then the zero crossing detector returns to step 1481.
- step 1483 a running count of the zero crossings is incremented by one and the process performed by the zero crossing detector 1452 goes back to step 1481.
- step 1484 a Root Mean Squared value of zero crossing is determined by an equation.
- the Root Mean Squared value of the zero crossing is given by the equation: RMS zero crossing equals alpha times zero crossing count + (1 -alpha ) times RMS zero crossing.
- Alpha is a fraction less than 1.
- the zero crossing detector process 1452 then continues to step 1485. At step 1485 a determination is made whether the RMS zero crossing value is greater than a threshold value.
- zero crossing flag is set. Speech tends to have a high number of zero crossings. Thus, a greater number of zero crossings tends to indicate speech is present. If it is determined that the RMS zero crossing is less than or equal to the threshold value a zero crossing flag is cleared. Then the Zero Crossing Detector proceeds to the next frame.
- the noise detection process 1453 steps through two branches one at step 1488 and another at step 1489.
- the autocorrelation of the frame is determined through the equation r[0].
- an autocorrelation of the frame using a delay of 10 samples is made. Thus, a 10 th order correlation is made on this frame using the equation of block 1489.
- step 1488 After completion of step 1488, the processes jumps to process 1490.
- step 1490 a root mean squared calculation of the autocorrelation r[0] is determined by the equation shown in block 1490.
- step 1489 After the completion of the step 1489, the process jumps to 1491.
- step 1491 a root mean squared calculation of the other correlation r[10] is made through the equation shown in block 1491.
- the noise detection process After completing the steps 1490 and 1491, the noise detection process jumps to step 1492.
- step 1492 a determination is made as to whether the root means squared of the autocorrelation r[0](i.e. r[0]_RMS) of the frame multiplied by a correlation threshold is greater than the root means squared of the autocorrelation of the frame using a tenth delayed sample (i.e. r[10]_RMS).
- step 1492 makes the determination that noise is present, the noise flag is set by the noise detection process 1453. If at step 1492 the determination is made that noise was not present, the noise flag is cleared by the noise detection processes 1453. In either case the process continues by processing the next frame of data.
- the process of the energy discriminator 1454 is illustrated to determine the amount of energy present in a frame.
- the energy discriminator starts at step 1494.
- the auto correlation of the frame of data r[0] is made.
- the equation for r[0] is illustrated in the block 1494.
- the energy discriminator 1454 jumps to step 1495.
- the logarithm of the autocorrelation of the frame is compared against an energy threshold.
- step 1495 If it is determined that in step 1495 that the logarithm of the autocorrelation of the frame is greater than an energy threshold, the energy discriminator 1454 sets the energy flag to 1 and jumps to the next frame. Thus, if the energy threshold is met, then there is a greater likelihood speech is present. If the logarithm of the autocorrelation of the frame is less than the energy threshold, the energy flag is cleared, the energy discriminator goes to the next frame and at step 1496 the energy threshold is updated in the energy discriminator process 1454. The energy threshold is updated to keep track of background noise. Thus, this step updates the energy threshold only when the energy flag is found to be set to zero.
- step 14G a flow diagram of the process for the instantaneous energy discriminator 1455 is illusfrated.
- the speech input samples are framed by the framer 1450.
- the steps 1465 and 1466 are begun in parallel within the instantaneous energy discriminator 1455.
- the autocorrelation of the frame is determined by the equation in block 1465.
- the previous autocorrelation calculation i.e. pr evR[0]
- step 1469 the instantaneous energy level process jumps to step 1469.
- step 1466 the autocorrelation of the frame is made using a 10 th order delayed sample as shown by the equation illusfrated in block 1466.
- a 10 th order correlation is made using this equation.
- the process for the instantaneous energy discriminator jumps to step 1467.
- the root means squared calculation from the 10th sample is made. Additionally the previous root means squared calculation of the correlation r[10] is updated. After completing step 1467 the process jumps to step 1468.
- the instantaneous energy discriminator process determines a difference between root means squared value of the auto correlation of the current frames tenth delayed sample from the root means squared value of the auto correlation of the previous frames tenth delayed sample by the equation in the block 1468. After completion of the calculation in step 1468, instantaneous energy discriminator process 1455 jumps to step 1469.
- the difference rlO corresponds to the difference of higher-order harmonics (representative of speech) between two consecutive frames (possibly of speech).
- this value is greater than the previous frame's autocorrelation multiplied by a starting threshold it more likely represents a speakers change in pitch (e.g. a speaker goes from talking at a normal voice to high pitched voice) rather than an instantaneous burst of noise.
- step 1469 it is determined that the difference is less than or equal to the previous frames autocorrelation multiplied by the starting threshold, then the instantaneous energy discriminator clears the instantaneous flag and goes on to process the next frame of data.
- the ALC 1401 reads the voice_activity flag and applies gain control if voice is detected. Otherwise if the voice_activity flag indicates silence, the ALC 1401 does not apply gain and passes Sout through without amplitude change as its output.
- the packetizer/encoder 1409 reads the voice activity flag to determine if a current frame of data contains a valid voice signal or not. If the current frame is voice, then the output from the ALC needs to be added into the PCM payload. If the current frame is silence and an SID has been generated by the VAD 1405, the packetizer/encoder 1049 reads the SID information stored in the shared memory in order for it to be packetized.
- the ALC 1401 functions in response to the VAD 1405.
- the VAD 1405 may look over the last one or more frames of data to determine whether or not the ALC information should be added to a frame or not.
- the ALC 1401 applies gain confrol if voice is detected else Sout is passed through without any change.
- the tone detector 1404 disables and enables the ALC 1401 as described above to comply with the G.169 specification. Additionally, the ALC 1401 is disabled when Sout signal level goes below certain threshold (-40 dBm for example) after Echo Cancellation by the echo canceller 1403. If current frame contains valid voice data, then the output gain information from the ALC 1401 is added to the PCM payload by the packetizer. Otherwise if silence is detected, the packetizer uses the SID information to generate packets to be sent as the send_packets.
- the DTMF detector 1406 functions in response to the output from the ALC 1401.
- the DTMF detector 1406 uses an internal frame size of 102 data samples but it adapts to any frame size of data samples.
- DTMF signaling events for a current frame are recorded in an InterFB area of shared memory.
- High level programs use DTMF signaling events stored in the InterFB area. Typically the high level program reads all the necessary info and then clears the contents for future use.
- the DTMF detector 1406 may read the VAD_activity flag to determine if voice signals are detected. If so, the DTMF detector may not execute until other signal types, such as tones, are detected. If the DTMF detector detects that a current frame of data contains valid DTMF digits, then a special DTMF payload is generated for the packetizer. The special DTMF payload contains relevant information needed to faithfully regenerate DTMF digits at the other end.
- the packetizer/encoder generates DTMF packets for transmission over the sendjpacket output.
- the Packetizer/Encoder 1409 includes a packet header of 1 byte to indicate which data type is being carried in the payload. The payload format depends on the data being transported. For example, if the payload contains PCM data then the packet will be quite larger than an SID packet for generating comfort noise.
- the packetizing may be implemented as part of the integrated telecommunications processor or it may be performed by an external network processor.
- the Depacketizer/Decoder 1410 receives a stream of packets over rx_packet and first determines what type of packet it is by looking at the packet header. After making a determination as to the type of packet received, the appropriate decoding algorithm can be executed by the integrated telecommunications processor.
- the type of packets and their possible decoding functions include Comfort Noise Generation (CNG), DTMF Generation, and PCMNoice decoding.
- the Depacketizer/Decoder 1410 generates frames of data which are used as Rin. In many cases, a single frame of data is generated by one packet of data.
- the comfort noise generator (CNG) 1420 receives commands from the depacketizer/decoder 1410 to generates a "comfortable” pink noise in response receiving an SID frame as a payload in a packet on the rx_packet.
- the comfort noise generator (CNG) 1420 generates the "comfortable” pink noise at a level corresponding to the noise power indicated in the SID frame.
- the comfort noise generated can have any spectral characteristics and is not limited to pink noise.
- the DTMF Generator 1407 receives commands from the depacketizer and generates DTMF tones in response to the depacketizer receiving a DTMF payload in a packet on rx_packet.
- the DTMF tones generated by the DTMF Generator 1407 correspond to amplitude levels, key, and possibly duration of the corresponding DTMF digit described in the DTMF payload.
- FIG 15 illustrates an exemplary memory map for the global buffer memory 210 to which each of the core processors 200 have access.
- the program memory 204 and the data memory 202 for each of four core processors 200A-200D (Core 0 to Core 3) is also illustrated in Figure 15 as being stacked upon each other.
- the program memory 204C and the data memory 202C for the core processor 200C (Core 2) is expanded in Figure 15 to show an exemplary memory map.
- Figure 15 also illustrates the file registers 413 for one of the core processors, core processor 200C (Core 2).
- the memory of the integrated telecommunications processor 150 provides for flexibility in how each communication channel is processed. Firmware and data can be swapped in and out of the core processors 200 when processing a different job. Each job can vary by channel, by frame, by data blocks or otherwise with changes to the firmware. In one embodiment, each job is described for a given frame and a given channel. By providing the functionality in firmware and swapping the code into and out of program memory of the core processors 200, the functionality of the integrated telecommunications processor 150 can be easily modified and upgraded.
- Figure 15 also illustrates the interrelationship between the global buffer memory 210, data memory 202 for the core processors 200, and the register files 413 in the signal processing units 300 of each core processor 200.
- the multichannel memory movement engine 208 flexibly and efficiently manages the memory mapping so as to extract the maximum efficiency out of each of the algorithm signal processors 300 for a scalable number of channels. That is, the integrated telecommunications processor 150 can support a yarying number of communication channels which is scalable by adding additional core processors because the signal processing algorithms and data are stored in memory are easily swapped into and out of many core processors. Furthermore, the memory movement engine 208 can sequence through different signal processing algorithms to provide differing module functionality for each channel.
- All algorithm data and code segments are completely relocatable in any memory space in which they are stored. This allows processing of each frame of data to be completely independent from the processing of any other frame of data for the same channel. In fact, any frame of data may be processed on any available signal processor 300. This allows maximum utilization of the processor resources at all times.
- Frame processing can be partitioned into several pieces corresponding to algorithm specific functional blocks such as those for the integrated telecommunications processor illustrated in Figures 11-14.
- the "fixed" (non-changing) code and data segments associated with each ofthese functional blocks can be independently located in a memory space which is not fixed and only one copy of these segments need be kept regardless of the number of channels which are to be supported. This data can be downloaded and/or upgraded at any time prior to it's use.
- a table of pointers for example, can be used to specify where each ofthese blocks currently resides in a memory space.
- dynamic data spaces required by the algorithms, which are modifiable can be allocated at run-time and de-allocated when no longer needed.
- DMA can be utilized if the code and/or data segments for a functional block must be transferred from one memory space to another memory space in order to reduce the overhead associated with processor intervention in such transfer. Since the code and data blocks required by any functional block are completely independent of each other, "chains" of DMA transfers can be defined and executed to transfer multiple blocks from one memory space to another without processor intervention. These "chains” can be created or updated when needed based on the current processing requirements for a particular channel using the "catalog" of functional blocks currently available. A DMA module creating a description of DMA transfers can optimize the use of the destination memory space by locating the segments wherever necessary to minimize wasted space.
- the Global buffer memory 210 includes an Algorithm Processing (AP) Catalog 1500, Dynamic Data Blocks 1515, Frame Data Buffers 1520, Functional-Block (FB) & Script Header Tables 1525, Channel Control Structures 1530, DMA Descriptors List 1535, and a Channel Execution Queue 1540.
- AP Algorithm Processing
- Figure 16 is a block diagram illustrating another exemplary memory map for the global buffer memory 210 of the integrated telecommunications processor 150 and the inter-relationship of the blocks contained therein.
- the Algorithm Processing (AP) Catalog 1500 includes channel independent, algorithm specific constant data segments, code data segments and parameter data segments for any algorithm which may be required in the integrated telecommunications processor system. These algorithms include telecommunication modules for Echo cancellation (EC), tone detection and generation (TD), DTMF detection and generation (DTMF), GJxx CODECs, and other functional modules.
- Examples of the code data segments include DTMF code 1501, TD code 1502, and EC code 1503 for the DTMF, TD and EC algorithms respectively.
- Examples of the algorithm specific constant data segments include DTMF constants 1504, TD constants 1505, and EC constants 1506 for the DTMF, TD and EC algorithms respectively.
- the parameter data segments include DTMF parameters 1507, TD parameters 1508, and EC parameters 1509 for the DTMF, TD and EC algorithms respectively.
- the Algorithm Processing (AP) Catalog 1500 also includes a set of scripts (each containing a script data, script code, and a script DMA template) for each kind of frame processing required by the system.
- the same script may be used for multiple channels, if these channels all require the same processing.
- the scripts do not contain any channel specific information.
- Figure 15 illustrates script 1 data 1511A, script 1 code 1512A, and a script 1 DMA template 1513A through script N data 151 IN, script N code 1512N, and script N DMA template 1513N.
- the script 1 blocks (script 1 data 1511 A, script 1 code 1512A, script 1 DMA template 1513 A) in the AP catalog 1500 define the functional blocks required to accomplish specific processing of a frame of data of a any channel which requires the processing defined by this script and the addresses into the program memory 204 where the functional block code should be transferred and the data memory 202 where the data segments should be transferred. Alternately, these addresses into the program memory 204 and data memory 202 where the data segments should be transferred could be determined at run time by a core memory management function.
- the script 1 blocks also specify the order of execution of the functional blocks by one of the core processors 200.
- the script 1 code 1512A for example may define the functional blocks and order of execution required to accomplish echo cancellation and DTMF detection.
- script 1 blocks can specify "conditional" data transfer and execution such as a data transfer or an execution which depends on the result of another functional blocks results.
- conditional data transfers may include those surrounding the functional blocks such as whether or not call progress tones are detected.
- script DMA templates associated with each script block is used to construct the one or more channel specific DMA descriptors in the DMA descriptors list 1535 in the global memory buffer 210.
- the global buffer memory 210 also includes a table of Functional Block and Script Headers referred to as the FB and Script Header tables 1525.
- the FB and Script Headers tables 1525 includes the size and the global buffer memory starting addresses for each of the functional blocks segments and script segments contained in the AP Catalog 1500.
- the DTMF header table includes the size and starting addresses for the DTMF code 1501, the DTMF constants 1504 and the DTMF parameters 1507.
- a script 1 header table includes the size and starting addresses for the script 1 data 1511A, the script 1 code 1512A, and the script 1 DMA template 1513A.
- FB and Script Headers table 1525 in essence points to these blocks in the AP catalog 1500 including others such as the EC Code 1503, the EC constants 1506 and the EC Parameters 1509.
- the contents of FB and Script Header tables 1525 is updated whenever a new AP catalog 1500 is loaded or an existing AP catalog 1500 is updated in the global buffer memory 210.
- the global buffer memory also has channel specific data segments consisting of dynamic data blocks 1515 and frame data buffers 1520.
- the dynamic data blocks 1515 illustrated in the exemplary map of Figure 15 includes the dynamic data blocks for channels n (CHn) through channel p (CHp).
- the type of dynamic data blocks for each channel corresponds to the functional modules used in each channel.
- channel n has EC dynamic data blocks, TD dynamic data blocks, DTMF dynamic data blocks, and GJxxx codec dynamic data blocks.
- the dynamic data blocks required for channel 10 are chlO-DTMF, chlO-EC and chlO-TD, required for channel 102 are Chi 02 -EC and chl02-GJxx, and required for channel 86 is Ch86-EC.
- the frame data buffers 1520 include channel specific data segments for each channel for the far in data, far out data, near in data and near out data.
- the near in data and near out data are for the PSTN network side while the far in data and the far out data are for the packet network side.
- n channels may be supported such that there may be n sets of channel specific dynamic data segments and n sets of channel specific frame buffer data segments, h Figure 16, the channel specific frame data segments include chlO-Near hi data, chlO-Near Out data, chlO-Far In data, chlO-Far Out data, chl02-Near In, chl02-Far In, chl02-Near Out and chl02-Far Out in the frame data buffers 1520.
- the channel specific data segments and the channel specific frame data segments allows the integrated telecommunications processor 150 to process a wide variety of communication channels having differing parameters at the same time.
- the set of channel control structures 1530 in the global buffer memory 210 includes all information required to process the data for a particular channel.
- This information includes the channel endpoints (e.g. source and destination of TDM data, source and destination of packet data), a description of the processing required (e.g. Echo cancellation, VAD, DTMF, Tone detection, coding, decoding, etc , to use). It also contains pointers to locate the data resources required for processing (e.g. the script, the dynamic data blocks, the DMA descriptor list, the TDM (near in and near out) buffers, and the packet data (far in and far out) buffers). Statistics regarding the channel are also maintained in the channel control structure. This includes such things as the # of frames processed, the channel state (e.g.
- the channel control structures include channel control structures for channel 10 and channel 102 each of which point to respective dynamic data blocks 1515 and frame data buffers 1520.
- the DMA Descriptor lists 1535 in the global buffer memory 210 defines the source address, destination address, and size for every data transfer required between the Global buffer memory 210 and the program memory 204 and data memory 202 for processing the data of a specific channel.
- n sets of DMA descriptor lists exist for processing n channels.
- Figure 15 illustrates the DMA descriptors list 1535 as including CHm DMA descriptors list through CHn DMA descriptors list.
- the DMA Descriptor Lists 1535 includes CH 10 - DMA descriptors and CH 102 - DMA descriptors.
- the global buffer memory 210 further has a Channel Execution Queue 1540.
- the Channel Execution Queue 1540 schedules and monitors processing jobs for all the core processors 200 of the integrated telecommunications processor 150. For example, when a frame of data for a particular channel is ready to be processed, a "management function" creates or updates the DMA descriptor list for that channel based on the Script and block addresses found in the FB headers of the FBH table 1525 and/or channel control structure found in the script block 1530. The job is then scheduled for processing by the Channel Execution Queue 1540.
- the DMA descriptor list 1535 includes the transfer of the script itself from the global buffer memory 210 to the data memory 202 and program memory
- the core addresses are specified in such a way that they are applicable to ANY core which may process the job.
- the same DMA descriptor list may be used to transfer data to any one of the cores in the system. In this way, all necessary information to process a frame of data can be constructed ahead of time, and any core which may then become available can perform the processing.
- Scheduled job 1 points to the Ch 10 - DMA descriptors in the DMA Descriptor list 1535 for frame 40 of channel 10.
- the scheduled job n points to the Ch 102 - DMA descriptors in the DMA Descriptor list 1535 to process frame 106 of channel 102.
- the upper portion of the program memory 204C and data memory 202C illustrates an example of the program memory 204C including script code 1550, DTMF code 1551 for the DTMF generation and detection, and EC code 1552 for the echo cancellation module.
- the code stored in the program memory 204 varies depending upon the needs of a given communication channel. In one embodiment, the code stored in the program memory 204 is swapped each time a new communication channel is processed by each core processor 200. In another embodiment, only the code that needs to be swapped out, removed or added in the program memory 204 each time a new communication channel is processed by each core processor 200.
- the lower portion of the program memory 204C and data memory 202C illustrates the data memory 202C which includes script data 1560, interfunctional block data area 1561, DTMF constants 1504, DTMF Parameters 1507, CHn DTMF dynamic data 1562,
- These constants, variables, and parameters (i.e. data) stored in the data memory 202 varies depending upon the needs of a given communication channel.
- the data stored in the data memory 202 is swapped each time a new communication channel is processed by each core processor 200.
- FIG. 15 illustrates the Register File 413 for the core processor 200A (core 0).
- the register file 413 includes a serial port address map for the serial port 206 of the integrated telecommunications processor 150, a host port address map for the host port 214 of the integrated telecommunications processor 150, core processor 200A interrupt registers including DMA pointer address, DMA starting address, DMA stop address, DMA suspend address, DMA resume address, DMA status register, and a software interrupt register, and a semaphore address register. Jobs in the channel execution queue 1540 load the DMA pointer in the file registers 412 of the core processor.
- Figure 17 is an exemplary time line diagram of processing frames of data.
- the integrated telecommunications processor processes multiple frames of multiple channels.
- the time required to process a frame of data for any particular channel is in most cases much shorter than the time interval to receive the next complete frame of data.
- the time line diagram of Figure 17 illustrates two frames of data for a given channel, Frame X and Frame X+1, each requiring about twelve units of time to receive.
- the frame processing time is typically shorter and is illustrated in Figure 17 for example as requiring two units each to process Frame X and Frame X+1.
- the processing time for each frame is similar. Note that there is about ten units of delay time between the completion of processing of Frame X and the start of processing of Frame X+1. It would be an inefficient use of resources for a processor to sit idle during this delay time between received frames waiting for a new frame of data to be received in order to start processing.
- the integrated telecommunications processor 150 processes jobs for other channels and their respective frames of data instead of sitting idle between frames for one given channel.
- the integrated telecommunications processor 150 processes jobs which are completely channel and frame independent as opposed to processing one or more dedicated channels and their respective frames.
- Each frame of data for any given channel can be processed on any available core processor 200.
- FIG. 18 an exemplary time line diagram of how one or more core processors 200A-200N of the integrated telecommunications processor 150 processes jobs on frames of data for multiple communication channels.
- the arrows 1801A-1801E in Figure 18 represent jobs or idle time for the core processor 1 200A.
- the arrows 1802A- 1802D represent jobs or idle time for the core processor 2 200B.
- the arrows 1803A- 1803E represent jobs or idle time for the core processor N 200N.
- Arrows 1801D and 1803C illustrated idle time for core processor 1 and core processor N respectively. Idle times occur for a core processor only when there is no data available for processing on any currently active channel.
- the Ch### nomenclature above the arrows refers to the channel identifier of the job that is being processed over that time period by a given core processor 200.
- the Fr### nomenclature above the arrows refers to the frame identifier for the respective channel of the job that is being processed over that time period by the given core processor 200.
- the jobs, including a job description, are stored in the channel execution queue
- all channel specific information is stored in the Channel Confrol Structure, and all required information for processing the job is contained in the (channel independent) script code and script data, and the (channel dependent) DMA descriptor list which is constructed prior to scheduling the job.
- the job description stored in the channel execution queue therefore, need only contain a pointer to the DMA descriptor list.
- Core processor 200A processes job 1801A,job 1801B,job 1801C, waits during idle 1801D, and processes job 1801E.
- the arrow or job 1801A is a job which is performed by core processor 1 200A on the data of frame 10 of channel 5.
- the arrow or job 1801B is a job on the data of frame 2 of channel 40 by the core processor 1 200A.
- the arrow or job 1801C is a job on the data of frame 102 of channel 0 by the core processor 1 200A.
- the arrow or job 1801E is a job on the data of frame 11 of channel 87 by the core processor 1 200A.
- core processor 1 200A is idle for a short period of time during arrow or idle 1801D and otherwise use to process multiple jobs.
- Figure 18 illustrates an example of how job processing of frames of multiple telecommunication channels can be distributed across multiple core processors 200 over time in one embodiment of the integrated telecommunications processor 150.
- the number of channels supportable by the integrated telecommunications processor 150 is scalable.
- the processing power in each core processor 200 may be increased for example such as by faster hardware (faster transistors such as by narrower channel lengths) or improved software algorithms.
- the present invention has many advantages.
- One advantage of the present invention is that telephony processing is integrated into one processor.
- Another advantage of the present invention is that improved telephone communication channels are provided between a time division multiplexed (TDM) telephone network and a packetized network.
- TDM time division multiplexed
- Another advantage of the present invention is that all the telecommunications modules couple together as a unit and the interrelationships among different modules can then be exploited.
- the present invention enables aggregating a large number of TDM channels by providing all Telephony functions, compression, decompression and transceiving as separate packet channels over a packet network.
- the control mechanism of the present invention can process the data inputs and outputs of different TDM channels and sequence them efficiently for channel based signal processing in the hardware.
- the present invention has been described in particular embodiments, it may be implemented in hardware, software, firmware or a combination thereof and utilized in systems, subsystems, components or sub-components thereof.
- the elements of the present invention are essentially the code segments to perform the necessary tasks.
- the program or code segments can be stored in a processor readable medium or transmitted by a computer data signal embodied in a carrier wave over a transmission medium or communication link.
- the "processor readable medium” may include any medium that can store or transfer information.
- Examples of the processor readable medium include an elecfronic circuit, a semiconductor memory device, a ROM, a flash memory, an erasable ROM (EROM), a floppy diskette, a CD-ROM, an optical disk, a hard disk, a fiber optic medium, a radio frequency (RF) link, etc.
- the computer data signal may include any signal that can propagate over a transmission medium such as electronic network channels, optical fibers, air, electromagnetic, RF links, etc.
- the code segments may be downloaded via computer networks such as the Internet, Intranet, etc. In any case, the present invention should not be construed as limited by such embodiments, but rather construed according to the claims.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
- Telephonic Communication Services (AREA)
- Advance Control (AREA)
Abstract
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA002422197A CA2422197A1 (en) | 2000-09-09 | 2001-09-05 | Voice activity detector for integrated telecommunications processing |
AU2001288793A AU2001288793A1 (en) | 2000-09-09 | 2001-09-05 | Voice activity detector for integrated telecommunications processing |
EP01968553A EP1319226A2 (en) | 2000-09-09 | 2001-09-05 | Voice activity detector for integrated telecommunications processing |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US23151000P | 2000-09-09 | 2000-09-09 | |
US60/231,510 | 2000-09-09 | ||
US09/938,104 US20020116186A1 (en) | 2000-09-09 | 2001-08-23 | Voice activity detector for integrated telecommunications processing |
US09/938,104 | 2001-08-23 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2002021507A2 true WO2002021507A2 (en) | 2002-03-14 |
WO2002021507A3 WO2002021507A3 (en) | 2002-05-30 |
Family
ID=26925179
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2001/027596 WO2002021507A2 (en) | 2000-09-09 | 2001-09-05 | Voice activity detector for integrated telecommunications processing |
Country Status (6)
Country | Link |
---|---|
US (1) | US20020116186A1 (en) |
EP (1) | EP1319226A2 (en) |
CN (1) | CN1473321A (en) |
AU (1) | AU2001288793A1 (en) |
CA (1) | CA2422197A1 (en) |
WO (1) | WO2002021507A2 (en) |
Families Citing this family (74)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2350532B (en) * | 1999-05-28 | 2001-08-08 | Mitel Corp | Method to generate telephone comfort noise during silence in a packetized voice communication system |
US7012901B2 (en) * | 2001-02-28 | 2006-03-14 | Cisco Systems, Inc. | Devices, software and methods for generating aggregate comfort noise in teleconferencing over VoIP networks |
US7130281B1 (en) * | 2001-03-30 | 2006-10-31 | Cisco Technology, Inc. | Devices, softwares and methods with improved performance of acoustic echo canceler in VoIP communication |
EP1391106B1 (en) * | 2001-04-30 | 2014-02-26 | Polycom, Inc. | Audio conference platform with dynamic speech detection threshold |
US7941313B2 (en) * | 2001-05-17 | 2011-05-10 | Qualcomm Incorporated | System and method for transmitting speech activity information ahead of speech features in a distributed voice recognition system |
US7203643B2 (en) * | 2001-06-14 | 2007-04-10 | Qualcomm Incorporated | Method and apparatus for transmitting speech activity in distributed voice recognition systems |
US20030110029A1 (en) * | 2001-12-07 | 2003-06-12 | Masoud Ahmadi | Noise detection and cancellation in communications systems |
US7043006B1 (en) * | 2002-02-13 | 2006-05-09 | Aastra Intecom Inc. | Distributed call progress tone detection system and method of operation thereof |
US20030174657A1 (en) * | 2002-03-18 | 2003-09-18 | Wenlong Qin | Method, system and computer program product for voice active packet switching for IP based audio conferencing |
US20030212550A1 (en) * | 2002-05-10 | 2003-11-13 | Ubale Anil W. | Method, apparatus, and system for improving speech quality of voice-over-packets (VOP) systems |
EP1443498B1 (en) * | 2003-01-24 | 2008-03-19 | Sony Ericsson Mobile Communications AB | Noise reduction and audio-visual speech activity detection |
US20050015244A1 (en) * | 2003-07-14 | 2005-01-20 | Hideki Kitao | Speech section detection apparatus |
KR20050045764A (en) * | 2003-11-12 | 2005-05-17 | 삼성전자주식회사 | Apparatus and method for recording and playing voice in the wireless telephone |
JP4601970B2 (en) * | 2004-01-28 | 2010-12-22 | 株式会社エヌ・ティ・ティ・ドコモ | Sound / silence determination device and sound / silence determination method |
JP4490090B2 (en) * | 2003-12-25 | 2010-06-23 | 株式会社エヌ・ティ・ティ・ドコモ | Sound / silence determination device and sound / silence determination method |
KR100546780B1 (en) * | 2003-12-26 | 2006-01-25 | 한국전자통신연구원 | Voice over packet system using a plurality of digital signal processors and speech processing method therein |
US20050216260A1 (en) * | 2004-03-26 | 2005-09-29 | Intel Corporation | Method and apparatus for evaluating speech quality |
US8315865B2 (en) * | 2004-05-04 | 2012-11-20 | Hewlett-Packard Development Company, L.P. | Method and apparatus for adaptive conversation detection employing minimal computation |
US7756594B2 (en) * | 2004-06-14 | 2010-07-13 | Microsoft Corporation | Systems and methods for parsing flexible audio codec topologies |
US7254535B2 (en) * | 2004-06-30 | 2007-08-07 | Motorola, Inc. | Method and apparatus for equalizing a speech signal generated within a pressurized air delivery system |
US7139701B2 (en) * | 2004-06-30 | 2006-11-21 | Motorola, Inc. | Method for detecting and attenuating inhalation noise in a communication system |
US7155388B2 (en) | 2004-06-30 | 2006-12-26 | Motorola, Inc. | Method and apparatus for characterizing inhalation noise and calculating parameters based on the characterization |
US7590065B2 (en) * | 2004-08-04 | 2009-09-15 | Microsoft Corporation | Equal-opportunity bandwidth regulation |
US20060031607A1 (en) * | 2004-08-05 | 2006-02-09 | Microsoft Corporation | Systems and methods for managing input ring buffer |
US7917356B2 (en) | 2004-09-16 | 2011-03-29 | At&T Corporation | Operating method for voice activity detection/silence suppression system |
US7706901B2 (en) * | 2004-10-01 | 2010-04-27 | Microsoft Corporation | Low latency real-time audio streaming |
KR100677396B1 (en) * | 2004-11-20 | 2007-02-02 | 엘지전자 주식회사 | A method and a apparatus of detecting voice area on voice recognition device |
US20060149536A1 (en) * | 2004-12-30 | 2006-07-06 | Dunling Li | SID frame update using SID prediction error |
EP1681670A1 (en) * | 2005-01-14 | 2006-07-19 | Dialog Semiconductor GmbH | Voice activation |
WO2006104576A2 (en) * | 2005-03-24 | 2006-10-05 | Mindspeed Technologies, Inc. | Adaptive voice mode extension for a voice activity detector |
US20060241937A1 (en) * | 2005-04-21 | 2006-10-26 | Ma Changxue C | Method and apparatus for automatically discriminating information bearing audio segments and background noise audio segments |
US7808936B2 (en) * | 2005-05-09 | 2010-10-05 | J2 Global Communications, Inc. | Systems and methods for facsimile echo cancellation |
US20070033042A1 (en) * | 2005-08-03 | 2007-02-08 | International Business Machines Corporation | Speech detection fusing multi-class acoustic-phonetic, and energy features |
US7962340B2 (en) * | 2005-08-22 | 2011-06-14 | Nuance Communications, Inc. | Methods and apparatus for buffering data for use in accordance with a speech recognition system |
CN101507349B (en) * | 2006-08-22 | 2012-08-22 | 株式会社Ntt都科摩 | Radio resource opening/controlling method, radio base station and mobile station |
KR100932913B1 (en) * | 2007-12-06 | 2009-12-21 | 한국전자통신연구원 | Complex switch and switching method for processing IP data and voice signal simultaneously |
CN101859568B (en) * | 2009-04-10 | 2012-05-30 | 比亚迪股份有限公司 | Method and device for eliminating voice background noise |
US8560313B2 (en) * | 2010-05-13 | 2013-10-15 | General Motors Llc | Transient noise rejection for speech recognition |
JP2011015032A (en) * | 2009-06-30 | 2011-01-20 | Brother Industries Ltd | Communication apparatus |
WO2011069293A1 (en) * | 2009-12-10 | 2011-06-16 | 华为技术有限公司 | Method, apparatus and system for speech coding and decoding |
US8626498B2 (en) * | 2010-02-24 | 2014-01-07 | Qualcomm Incorporated | Voice activity detection based on plural voice activity detectors |
EP3493205B1 (en) | 2010-12-24 | 2020-12-23 | Huawei Technologies Co., Ltd. | Method and apparatus for adaptively detecting a voice activity in an input audio signal |
CN102971789B (en) * | 2010-12-24 | 2015-04-15 | 华为技术有限公司 | A method and an apparatus for performing a voice activity detection |
EP2828854B1 (en) | 2012-03-23 | 2016-03-16 | Dolby Laboratories Licensing Corporation | Hierarchical active voice detection |
CN103116402B (en) * | 2013-02-05 | 2016-01-20 | 威盛电子股份有限公司 | There is computer system and the sound control method of voice control function |
CN105379308B (en) | 2013-05-23 | 2019-06-25 | 美商楼氏电子有限公司 | Microphone, microphone system and the method for operating microphone |
US10020008B2 (en) | 2013-05-23 | 2018-07-10 | Knowles Electronics, Llc | Microphone and corresponding digital interface |
US9711166B2 (en) | 2013-05-23 | 2017-07-18 | Knowles Electronics, Llc | Decimation synchronization in a microphone |
US20140358552A1 (en) * | 2013-05-31 | 2014-12-04 | Cirrus Logic, Inc. | Low-power voice gate for device wake-up |
CN104424956B9 (en) | 2013-08-30 | 2022-11-25 | 中兴通讯股份有限公司 | Activation tone detection method and device |
US9502028B2 (en) | 2013-10-18 | 2016-11-22 | Knowles Electronics, Llc | Acoustic activity detection apparatus and method |
US9147397B2 (en) | 2013-10-29 | 2015-09-29 | Knowles Electronics, Llc | VAD detection apparatus and method of operating the same |
DE102014207417A1 (en) * | 2014-04-17 | 2015-10-22 | Robert Bosch Gmbh | Interface unit |
CN105261375B (en) | 2014-07-18 | 2018-08-31 | 中兴通讯股份有限公司 | Activate the method and device of sound detection |
US9826558B2 (en) * | 2014-08-25 | 2017-11-21 | Echostar Technologies L.L.C. | Wireless mute device and method |
US9704507B2 (en) * | 2014-10-31 | 2017-07-11 | Ensequence, Inc. | Methods and systems for decreasing latency of content recognition |
US9667801B2 (en) | 2014-12-05 | 2017-05-30 | Facebook, Inc. | Codec selection based on offer |
US9729601B2 (en) | 2014-12-05 | 2017-08-08 | Facebook, Inc. | Decoupled audio and video codecs |
US10506004B2 (en) * | 2014-12-05 | 2019-12-10 | Facebook, Inc. | Advanced comfort noise techniques |
US9729287B2 (en) | 2014-12-05 | 2017-08-08 | Facebook, Inc. | Codec with variable packet size |
US10469630B2 (en) | 2014-12-05 | 2019-11-05 | Facebook, Inc. | Embedded RTCP packets |
US9729726B2 (en) | 2014-12-05 | 2017-08-08 | Facebook, Inc. | Seamless codec switching |
WO2016118480A1 (en) | 2015-01-21 | 2016-07-28 | Knowles Electronics, Llc | Low power voice trigger for acoustic apparatus and method |
CN104581538B (en) * | 2015-01-28 | 2018-03-02 | 三星电子(中国)研发中心 | The method and apparatus to abate the noise |
US10121472B2 (en) | 2015-02-13 | 2018-11-06 | Knowles Electronics, Llc | Audio buffer catch-up apparatus and method with two microphones |
CN106328169B (en) * | 2015-06-26 | 2018-12-11 | 中兴通讯股份有限公司 | A kind of acquisition methods, activation sound detection method and the device of activation sound amendment frame number |
US9478234B1 (en) | 2015-07-13 | 2016-10-25 | Knowles Electronics, Llc | Microphone apparatus and method with catch-up buffer |
US10651827B2 (en) * | 2015-12-01 | 2020-05-12 | Marvell Asia Pte, Ltd. | Apparatus and method for activating circuits |
US11455985B2 (en) * | 2016-04-26 | 2022-09-27 | Sony Interactive Entertainment Inc. | Information processing apparatus |
FR3054362B1 (en) | 2016-07-22 | 2022-02-04 | Dolphin Integration Sa | SPEECH RECOGNITION CIRCUIT AND METHOD |
US10180820B2 (en) | 2016-09-30 | 2019-01-15 | HEWLETT PACKARD ENTERPRlSE DEVELOPMENT LP | Multiply-accumulate circuits |
TWI713016B (en) * | 2019-01-03 | 2020-12-11 | 瑞昱半導體股份有限公司 | Speech detection processing system and speech detection method |
CN110083465B (en) * | 2019-04-26 | 2021-08-17 | 上海连尚网络科技有限公司 | Data transmission method between boarded applications |
CN111405131B (en) * | 2020-03-30 | 2021-04-20 | 深圳震有科技股份有限公司 | Method, system and storage medium for detecting far-end off-hook signal |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0734012A2 (en) * | 1995-03-24 | 1996-09-25 | Mitsubishi Denki Kabushiki Kaisha | Signal discrimination circuit |
US5598466A (en) * | 1995-08-28 | 1997-01-28 | Intel Corporation | Voice activity detector for half-duplex audio communication system |
WO2000017856A1 (en) * | 1998-09-18 | 2000-03-30 | Conexant Systems, Inc. | Method and apparatus for detecting voice activity in a speech signal |
Family Cites Families (40)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0243561B1 (en) * | 1986-04-30 | 1991-04-10 | International Business Machines Corporation | Tone detection process and device for implementing said process |
US5142677A (en) * | 1989-05-04 | 1992-08-25 | Texas Instruments Incorporated | Context switching devices, systems and methods |
US4969118A (en) * | 1989-01-13 | 1990-11-06 | International Business Machines Corporation | Floating point unit for calculating A=XY+Z having simultaneous multiply and add |
US5325425A (en) * | 1990-04-24 | 1994-06-28 | The Telephone Connection | Method for monitoring telephone call progress |
US5341374A (en) * | 1991-03-01 | 1994-08-23 | Trilan Systems Corporation | Communication network integrating voice data and video with distributed call processing |
US5241492A (en) * | 1991-05-06 | 1993-08-31 | Motorola, Inc. | Apparatus for performing multiply and accumulate instructions with reduced power and a method therefor |
US5734789A (en) * | 1992-06-01 | 1998-03-31 | Hughes Electronics | Voiced, unvoiced or noise modes in a CELP vocoder |
JPH06150023A (en) * | 1992-11-06 | 1994-05-31 | Hitachi Ltd | Microcomputer and system thereof |
US5452289A (en) * | 1993-01-08 | 1995-09-19 | Multi-Tech Systems, Inc. | Computer-based multifunction personal communications system |
US5459814A (en) * | 1993-03-26 | 1995-10-17 | Hughes Aircraft Company | Voice activity detector for speech signals in variable background noise |
US6154828A (en) * | 1993-06-03 | 2000-11-28 | Compaq Computer Corporation | Method and apparatus for employing a cycle bit parallel executing instructions |
JP3532975B2 (en) * | 1993-09-27 | 2004-05-31 | 株式会社ルネサステクノロジ | Microcomputer and method of executing instructions using the same |
JPH07225593A (en) * | 1994-02-10 | 1995-08-22 | Fuji Xerox Co Ltd | Sound processor |
US5574927A (en) * | 1994-03-25 | 1996-11-12 | International Meta Systems, Inc. | RISC architecture computer configured for emulation of the instruction set of a target computer |
US5499272A (en) * | 1994-05-31 | 1996-03-12 | Ericsson Ge Mobile Communications Inc. | Diversity receiver for signals with multipath time dispersion |
US5541917A (en) * | 1994-09-12 | 1996-07-30 | Bell Atlantic | Video and TELCO network control functionality |
JP3579843B2 (en) * | 1994-10-24 | 2004-10-20 | 日本テキサス・インスツルメンツ株式会社 | Digital signal processor |
US5530663A (en) * | 1994-11-14 | 1996-06-25 | International Business Machines Corporation | Floating point unit for calculating a compound instruction A+B×C in two cycles |
WO1996018153A1 (en) * | 1994-12-08 | 1996-06-13 | Intel Corporation | A method and an apparatus for enabling a processor to access an external component through a private bus or a shared bus |
US5727194A (en) * | 1995-06-07 | 1998-03-10 | Hitachi America, Ltd. | Repeat-bit based, compact system and method for implementing zero-overhead loops |
FI110826B (en) * | 1995-06-08 | 2003-03-31 | Nokia Corp | Eliminating an acoustic echo in a digital mobile communication system |
FI105001B (en) * | 1995-06-30 | 2000-05-15 | Nokia Mobile Phones Ltd | Method for Determining Wait Time in Speech Decoder in Continuous Transmission and Speech Decoder and Transceiver |
JP2931890B2 (en) * | 1995-07-12 | 1999-08-09 | 三菱電機株式会社 | Data processing device |
US5983253A (en) * | 1995-09-05 | 1999-11-09 | Intel Corporation | Computer system for performing complex digital filters |
US6058408A (en) * | 1995-09-05 | 2000-05-02 | Intel Corporation | Method and apparatus for multiplying and accumulating complex numbers in a digital filter |
JP3767930B2 (en) * | 1995-11-13 | 2006-04-19 | 沖電気工業株式会社 | Information recording / reproducing method and information storage device |
US5826072A (en) * | 1995-11-13 | 1998-10-20 | Oasis Design, Inc. | Pipelined digital signal processor and signal processing system employing same |
US5881060A (en) * | 1996-05-30 | 1999-03-09 | Northern Telecom Limited | Integrated cellular voice and digital packet data telecommunications systems and methods for their operation |
US5774849A (en) * | 1996-01-22 | 1998-06-30 | Rockwell International Corporation | Method and apparatus for generating frame voicing decisions of an incoming speech signal |
JP3658072B2 (en) * | 1996-02-07 | 2005-06-08 | 株式会社ルネサステクノロジ | Data processing apparatus and data processing method |
US5940785A (en) * | 1996-04-29 | 1999-08-17 | International Business Machines Corporation | Performance-temperature optimization by cooperatively varying the voltage and frequency of a circuit |
DE19625569A1 (en) * | 1996-06-26 | 1998-01-02 | Philips Patentverwaltung | Signal processor |
WO1998006030A1 (en) * | 1996-08-07 | 1998-02-12 | Sun Microsystems | Multifunctional execution unit |
DE19639703C2 (en) * | 1996-09-26 | 1999-05-20 | Siemens Ag | Method and arrangement for echo cancellation |
KR100201776B1 (en) * | 1996-11-06 | 1999-06-15 | 김영환 | Adaptive equalizer |
US5880984A (en) * | 1997-01-13 | 1999-03-09 | International Business Machines Corporation | Method and apparatus for performing high-precision multiply-add calculations using independent multiply and add instruments |
DE69831991T2 (en) * | 1997-03-25 | 2006-07-27 | Koninklijke Philips Electronics N.V. | Method and device for speech detection |
US6029267A (en) * | 1997-11-25 | 2000-02-22 | Lucent Technologies Inc. | Single-cycle, soft decision, compare-select operation using dual-add processor |
US5995122A (en) * | 1998-04-30 | 1999-11-30 | Intel Corporation | Method and apparatus for parallel conversion of color values from a single precision floating point format to an integer format |
US6330660B1 (en) * | 1999-10-25 | 2001-12-11 | Vxtel, Inc. | Method and apparatus for saturated multiplication and accumulation in an application specific signal processor |
-
2001
- 2001-08-23 US US09/938,104 patent/US20020116186A1/en not_active Abandoned
- 2001-09-05 AU AU2001288793A patent/AU2001288793A1/en not_active Abandoned
- 2001-09-05 CN CNA018184464A patent/CN1473321A/en active Pending
- 2001-09-05 CA CA002422197A patent/CA2422197A1/en not_active Abandoned
- 2001-09-05 EP EP01968553A patent/EP1319226A2/en not_active Withdrawn
- 2001-09-05 WO PCT/US2001/027596 patent/WO2002021507A2/en not_active Application Discontinuation
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0734012A2 (en) * | 1995-03-24 | 1996-09-25 | Mitsubishi Denki Kabushiki Kaisha | Signal discrimination circuit |
US5598466A (en) * | 1995-08-28 | 1997-01-28 | Intel Corporation | Voice activity detector for half-duplex audio communication system |
WO2000017856A1 (en) * | 1998-09-18 | 2000-03-30 | Conexant Systems, Inc. | Method and apparatus for detecting voice activity in a speech signal |
Non-Patent Citations (1)
Title |
---|
VARADA S ET AL: "HARDWARE STRATEGIES FOR END-POINT DETECTION" PROCEEDINGS OF THE SOUTHCON CONFERENCE. FORT LAUDERDALE, MAR. 7 - 9, 1995, NEW YORK, IEEE, US, 7 March 1995 (1995-03-07), pages 163-167, XP000531183 ISBN: 0-7803-2577-X * |
Also Published As
Publication number | Publication date |
---|---|
CN1473321A (en) | 2004-02-04 |
US20020116186A1 (en) | 2002-08-22 |
WO2002021507A3 (en) | 2002-05-30 |
EP1319226A2 (en) | 2003-06-18 |
AU2001288793A1 (en) | 2002-03-22 |
CA2422197A1 (en) | 2002-03-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2422203C (en) | Tone detection for integrated telecommunications processing | |
US20020116186A1 (en) | Voice activity detector for integrated telecommunications processing | |
US6738358B2 (en) | Network echo canceller for integrated telecommunications processing | |
US6598155B1 (en) | Method and apparatus for loop buffering digital signal processing instructions | |
US6446195B1 (en) | Dyadic operations instruction processor with configurable functional blocks | |
JPH0585149U (en) | Digital speaker telephone | |
US6832306B1 (en) | Method and apparatus for a unified RISC/DSP pipeline controller for both reduced instruction set computer (RISC) control instructions and digital signal processing (DSP) instructions | |
CN1971710B (en) | Single-chip based multi-channel multi-voice codec scheduling method | |
Ogunfunmi et al. | Speech over VoIP networks: Advanced signal processing and system implementation | |
WO2002021780A1 (en) | Integrated telecommunications processor for packet networks | |
JP2003324372A (en) | Improved acoustic echo cancellation | |
Nishitani et al. | LSI signal processor development for communications equipment | |
EP0122594A2 (en) | Line circuit with echo compensation | |
CN1972308A (en) | Method for opening channel of DSP-based single-chip multi-channel multi-voice codec | |
Mishra et al. | Efficient hardware-software co-design for the G. 723.1 algorithm targeted at VoIP applications | |
KR20000057739A (en) | Subband echo canceller and method therefor | |
JPH11331047A (en) | Multiple channel echo erasing device raving compander | |
KR20020016650A (en) | Method and apparatus for combining corded and cordless telephones for telephone conferencing and intercom | |
KR19980048460A (en) | Echo Cancellation Method and Apparatus Using Linear Prediction Coefficient in Mobile Communication Systems | |
Dahlberg | Evaluation of a Floating Point Acoustic Echo Canceller Implementation | |
CN117577123A (en) | Echo cancellation device based on audio coder and decoder and electronic terminal | |
JPH1041860A (en) | Echo canceller | |
Park | 170 MIPS Real-Time Adaptive Digital Filter Board | |
Andreyev et al. | DSP UNITS FOR IP-TELEPHONY SYSTEMS | |
Casale et al. | Optimal architectural solution using DSP processors for the implementation of an ADPCM transcoder |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
AK | Designated states |
Kind code of ref document: A3 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A3 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2422197 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2001968553 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 018184464 Country of ref document: CN |
|
WWP | Wipo information: published in national office |
Ref document number: 2001968553 Country of ref document: EP |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
NENP | Non-entry into the national phase |
Ref country code: JP |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2001968553 Country of ref document: EP |