REAL-TIME SOFTWARE RECEIVER
STATEMENT OF GOVERNMENT INTEREST This invention was made with United States Government support from the Office of Naval Research (ONR) under contract number N00014-02-J-1822 and from the National Aeronautics and Space Administration (NASA) under contract numbers NCC5-563, NAG5- 11819, and NAG5-12089. The United States Government has certain rights in the invention.
BACKGROUND OF THE INVENTION
This invention relates generally to software radio receivers, and more specifically to a software receiver for positioning systems.
A typical positioning system receiver, such as is used in the Global Positioning System (GPS), includes an antenna, a radio frequency (RF) section, a correlator, a signal tracking and demodulation component, and a component to compute the navigation solution. The antenna, which is possibly followed by a pre-amplifϊer, receives L-band GPS signals. The RF section filters and down converts the GHz GPS signal to an intermediate frequency in the MHz range. The RF section also digitizes the signal. The correlator separates the down- converted signal into different channels (ten or more in modern receivers) allocated to each satellite. For each satellite, the correlator mixes the Doppler-shifted intermediate frequency signal to baseband by correlating it with a local copy of the carrier replica signal and it distinguishes the particular satellite by correlating the signal with a pseudo-random number (PRN) code. Software routines cause the carrier replica and PRN replica signals to track the actual received signal, extract the navigation message, and compute the navigation solution. Baseband mixing is a multiplication of an input signal by a complex exponential where the frequency of the complex exponential approximately matches that of the input signal. The resultant signal is centered at baseband. A complex signal can be broken down into cosine and sine signal components, resulting in separate in-phase and quadrature components. The frequency of the baseband mixed signal must be controllable to within a few millihertz in the case of a phase-locked loop for use in a precision navigation system, and
the baseband mixed signal must have a continuously varying phase. i a hardware correlator, local oscillators generate cosine and sine signal components that have precise frequency control and a continuous phase. Generating cosine and sine signal components on the fly with the correct frequency and phase is too time consuming to be feasible for a software correlator. Instead, the software correlator generates cosine and sine signal components on a grid of frequencies off-line. These signal components must be stored on a time grid of points sampled at the RF front-end sampling frequency, for example, at 5.714 MHz for one particular RF front-end hardware configuration, and the signals must last for a typical accumulation period , e.g., for a 0.001 second coarse/acquisition (C/A) PRN code period when working with GPS LI civilian signals. It takes tens of gigabytes of memory or more in order to brute-force store all frequencies on a one mHz grid ranging from -10 KHz to +10 KHz, which is the needed frequency range when tracking GPS satellites from a terrestrial receiver, and additional storage is required to store a grid of possible. starting phases at each frequency point.
PRN code mixing is a multiplication of a baseband mixed signal by a prompt +1/-1 PRN code or by a +2/0/-2 early-minus-late PRN code, where the code timing and frequency approximately match that of the input signal. The resultant signal is a constant in the case of prompt PRN code mixing, and an approximately linear function of the code timing error in the case of early-minus-late mixing. A receiver accumulates both of these correlation outputs. The magnitude of the prompt accumulation indicates signal strength and whether a signal has been detected, and its in-phase (real) and quadrature (imaginary) components are used to measure carrier phase and Doppler shift. The magnitude of the early-minus-late accumulation measures the code timing error; it will be zero when the timing error is zero.
The code phase of the baseband mixing signal must be controllable to within a percent or less of a PRN code chip for use in a precision navigation system. In a hardware correlator, local oscillators generate the prompt and early-minus-late PRN code replicas. A software correlator can either compute and store PRN code replicas, or compute them in real-time.
The current Global Positioning System is slated to realize expanded capabilities that include new civilian codes on the L2 frequency, a new L5 frequency, and new codes (M-cpde, CL and CM codes) on the L2 frequency. Some of these upgrades are slated to start within one to three years. A hardware correlator requires hardware modifications in order to use these
new signals. In the near term, a receiver designer will be faced with a complex trade-off in order to decide whether the extra complexity is worth the improved performance that will accrue only very slowly as new GPS satellites replace older models. One way to avoid the complex trade-off is to use a software receiver that can receive and process new signals without the need for a new correlator chip set.
A software receiver is flexible because its software components can be easily modified. One application of a software receiver is to merge together numerous devices that use wireless digital communication protocols to form a single-device. For example, a cell phone, GPS receiver, and Personal Data Assistant (PDA) could become a single device that plays the role of all three. Another use of a software receiver is to shorten development and to-market times for new wireless devices. For example, as new frequencies and codes are added to GPS, a software receiver having a software correlator simply needs to be reprogrammed, while a hardware approach would require a brand new correlator chip design. New PRN codes can be used simply by making software changes. Thus, software receiver technology lessens the risks involved for designers during the period of transition to the new signals. Furthermore, a software receiver could be reprogrammed to use the Galileo system (European GPS) or GLONASS (Russian GPS).
In the recent past, GPS software receivers have been developed that either post-process stored signals or operate in real-time. Previous real-time software receivers function with a limited number of channels (4-6) or require high-end computer speeds or digital signal processor (DSP) chips such as are disclosed in Real-Time GPS Software Radio Receiver, Akos et al., ION NTM 2001, 22-24 January 2001, Long Beach, CA, pp. 809-816 (Akos 2001a), and Global Positioning System Software Receiver (gpSrx) Implementation in Low Cost/Power Programmable Processors, Akos et al., ION GPS 2001, 11-14 September 2001, Salt Lake City, UT, pp. 2851-2858, both incorporated herein in their entireties by reference. Therefore, it is an object of the present invention to create a software receiver that operates in real-time and is not restricted to a severely limited number of channels or to a very fast processor.
Another object of the present invention is to minimize the number of sine and cosine signal components that must be stored.
A further object of the present invention is to process incoming signals through bit-wise parallelism.
A still further object of the present invention is to process over-sampled signals by use of bit-wise parallelism. A still further object of the present invention is to use very long over-sampled PRN codes efficiently in a bit-wise parallel software receiver.
SUMMARY OF THE INNENTION The objects set forth above as well as further and other objects are addressed by the present invention. The solutions and advantages of the present invention are achieved by the illustrative embodiment described herein below.
The software receiver system and method of the present invention enable the efficient execution of a set of algorithms, that perform software correlation on data sampled from incoming channels, on a general purpose processor. The system and method of the present invention provide for either PRN code storage or computation of PRN codes in real-time. PRN code storage is appropriate for PRN codes that have short periods, such as the GPS coarse/acquisition codes, which are 1023 chips long. In this case, the system and method of the present invention pre-compute over-sampled replicas of entire PRN code periods and store them for orderly and efficient retrieval, such as in a table. This table can include a selection of code start times as measured relative to the sample times at which RF data are available from the receiver front end. There is a separate table for each unique PRN code.
The system and method of the present invention can also generate over-sampled versions of the prompt and early-minus-late PRN codes in real-time through use of an over- sampling function described herein. The values of the over-sampling function can be located in a specially designed table that can be generic across PRN codes. The length of the specially designed table can be independent of the length of the PRN code whose replica is being used to process a given received signal. The system and method of the present invention include techniques for efficiently calculating indices into the specially designed table that enable rapid, real-time table look-up.
The system of the present invention includes a software correlator that can mix the received signal to baseband, compute baseband PRN correlations through bit-wise parallelism and look-up tables using either the tabulated or real-time-generated PRN codes, and compute accumulations through bit-wise parallelism and processor instructions or look-up tables. Bit- wise parallelism allows the processing of multiple data samples simultaneously as the multiple bits of a given word of computer data. For example, for 32-bit words, the software correlator can process up to 32 samples at a time. Bit-wise parallelism can optimally operate when each signal in question can be represented by only a few bits, which is normally the case in RF digital signal processing of navigation signals. The bit-wise parallel operations of the present invention can save computation time in comparison to integer mathematical correlation operations. If, for example, four accumulations are required per sample, integer mathematics requires six multiplications and four additions per sample (except for the last sample). At a sampling rate of, for example, 5.714 MHz this translates into 57, 140 integer operations per PRN code period. In the illustrative embodiment, 33,500 bit-wise parallel operations are necessary per PRN code period when the RF signal has a 2-bit representation. This operation count is further reduced to approximately 16,750 bit-wise parallel operations per PRN code period when the RF signal has a 1-bit representation. Thus, there can be a savings of almost a factor of two to almost a factor of four in the operation count. The system and method of the present invention also include a table of pre-computed baseband mixing sine waves, algorithms that can produce correlation accumulation outputs that are equivalent to what would be produced by a continuously variable sine wave, and a method of use of the table and algorithms. Thus, in the present invention, a relatively small set of sine wave values need to be pre-computed and saved, which can conserve computer memory and processing time.
The present invention also includes a system and method for tracking the phase of PRN code replicas in software in order to track the timing of any given "chip" of the PRN code replica as measured with respect to a pre-specified set of sample times at which the basic raw data comes out of the RF front end (a chip is an element of a PRN code). The PRN code phase is kept track of via a variable for each channel, that indicates the PRN code start time
with respect to the RF sample times. The system and method of the present invention allow for the synchronization of the measurements of PRN code phase, carrier phase, and carrier frequency for each satellite relative to these sample times.
The method for tracking the phase of each PRN code replica and the phase of each carrier replica includes the steps of latching all the C/A code phases, carrier phases, epoch counters, and carrier frequencies for each satellite at a pre-specified time, and computing the pseudo range to each satellite using the C/A code phase and epoch counters. The method also includes the step of tracking and updating code and carrier phases by estimating code chipping rate and carrier Doppler shift inputs. The method further includes the step of computing the code phase at the pre-specified time for each satellite as a function of the updated code chipping rate and the pre-specified time. The method further includes the step of computing the carrier phase at the pre-specified time as a function of the updated carrier phase, the Doppler shift, and the pre-specified time. The timing of the PRN code phase (or chip location) is the most fundamental of GPS measurements for use in navigation data processing. The monitoring of these times in software allows complete control of the precision with which they can be measured, and it allows precise synchronization of these times with the measurement times of data from other sensors, such as inertial measurement units. This feature gives an enhanced ability to develop what are known as deeply coupled systems that must fuse GPS data with data from other types of sensor systems. The software correlator of the present invention can advantageously be easily adapted to accept signals at any frequency, new PRN codes, or even signals for different types of devices. Thus, the same processing hardware could use the software correlator to implement such devices as a GPS receiver, a cell phone, or both. To allow for new codes, new frequencies, and new types of functionality, small changes can be made in the software correlator, or different versions of the software correlator can be run on the same processor. Hardware-correlator-based receivers of the prior art can deal only with frequencies and PRN codes that are hard-wired into their designs. Also, the system and method of the present invention could be implemented within systems such as GLONASS receivers, cell phones and cell base stations, pagers, wireless Ethernet (e.g. 802.11x standards), Bluetooth™, Blackberry® wireless internet devices, and satellite radio/phones (e.g. INMARSAT®). In
fact, the system and method of the present invention are applicable to any sort of telecommunication system/device that uses spread spectrum, code division multiple access (CDMA) PRN codes for the transmission of information, either wired or wireless.
For a better understanding of the present invention, together with other and further objects thereof, reference is made to the accompanying drawings and detailed description. The scope of the present invention is pointed out in the appended claims.
DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING FIG. 1 is a schematic block diagram of the hardware environment of a typical software receiver;
FIGs. 2A and 2B are schematic diagrams of bit-wise mappings of signal and carrier replica sign and magnitude bits to computer data words;
FIG. 2C is a graphic representation of a plot of bit- wise parallel radio frequency signal and PRN code replica storage and mixing; FIG. 2D is a graphic representation of a plot of sections of prompt, early, late, and early-minus-late PRN code signals and 16-bit word representations of their over-sampled equivalents;
FIGs. 3 A and 3B are data flow diagrams illustrating the bit-wise parallelism process (replicated twice, once for the in-phase carrier replica and once for the quadrature carrier replica) of the present invention;
FIG. 3C is a graphic representation of a plot of a prior art optimal 2-bit representation of a sine wave presented to enhance the reader's understanding of the present invention;
FIGs. 4A and 4B are flowcharts of the method for computing correlation accumulations through bit-wise parallel computations of the present invention; FIG. 5 is a schematic diagram of a look-up table layout as a function of code time offset and chip bit pattern;
FIG. 6 is a graphic representation of a plot illustrating the timing relationship between data sample words and the sequence of prompt code chips that defines an accumulation interval;
FIG. 7 is a flowchart of the method for computing bit-wise parallel representations of the over-sampled prompt PRN code replica and the over-sampled early-minus-late PRN code replica for an entire accumulation interval using the real-time over-sampled PRN code generation algorithm. FIG. 8 is a graphic representation of a plot that illustrates the location in time at which the code phase of each signal is computed; and
FIGs. 9A and 9B are graphic representations of plots of correlations of the true sampled code with prompt (FIG. 9A) and early-minus-late (FIG. 9B) versions of the true and table look-up codes, the latter being generated by the new real-time over-sampled PRN code generator.
DETAILED DESCRIPTION OF THE INVENTION
The present invention is now described more fully hereinafter with reference to the accompanying drawings, in which the illustrative embodiment of the present invention is shown. The following configuration description is presented for illustrative purposes only. Any computer configuration satisfying the speed and interface requirements herein described may be suitable for implementing the system of the present invention. The equations herein are stated in general terms, but have parameters that are specific to the GPS LI C/A signal for illustrative purposes only. For example, the 0.001 sec. accumulation interval seen in many of the equations is the nominal C/A code period. Also, the C/A PRN code of the illustrative embodiment can be replaced by the PRN code of any other CDMA signaling system.
By way of introductory explanation, RF signal processing equations and terms are herein provided. The time-domain LI C/A signal received from, for example, a satellite, is represented by: y(tι) = j A]D]kCJ[ .00l -^^-))co [ωIFt1 - ,(*.)] + », (1)
where t, is the sample time, A. is the amplitude, Djk is the navigation data bit, C^f is the C/A code, τ k and τ k+. are the start times of the received kth and k + V C/A code
periods, ω is the intermediate frequency corresponding to the LI carrier frequency, }(t,) is the carrier phase perturbation due to accumulated delta range, n. is the receiver noise, and the subscript j refers to a particular GPS satellite. The summation is over all visible GPS satellites. The negative sign in front of φ( comes from the high-side mixing that occurs in the RF front-end that has been used in the illustrative embodiment. The signal in equation (1) is the output of a typical RF front end.
A GPS receiver works with correlations between the received signal and a replica of it The correlations are used to acquire and track the signal. The replica is composed of two parts, the carrier replica and the C/A PRN code replica. Two carrier replica signals are used, an in-phase signal and a quadrature signal. When mixed with the received signal from the RF front end they form the in-phase and quadrature baseband mixed signals represented by: ~[φ
jk+ώ
Domk(t
τ -χ
Jk)] } (2)
where equations (2) and (3) apply during the k'h C/A code period. In these equations χjk and Tjk+ϊ are the receiver's estimates of the start times of the k and k+V code periods, φ k is the estimated carrier phase at time χ≠ , and >Domk 1S trΛe estimated carrier Doppler shift during the kth code period.
A typical receiver computes the estimates χjk, χjte+1, ^, a d β)Oomι by various conventional means that are described in GPS Receivers, A J. Nan Dierendonck, Global Positioning System: Theory and Applications. B. W. Parkinson and J. J. Spilker, Jr , Eds., Vol. I, American Institute of Aeronautics and Astronautics, 1996, Chapter 8, pp. 329-406 (Dierendonck), incorporated herein in its entirety by reference. These include open-loop acquisition methods and closed-loop signal tracking methods such as a delay-locked loop to
compute _fc and jM and a phase-locked loop or a frequency-locked loop to compute φ k and Soopp ■ The software receiver developed herein uses conventional techniques for forming these estimates.
Both prompt and early-minus-late correlations are needed to track the carrier frequency, carrier phase, and code phase in a GPS receiver. A typical receiver uses the PRN code and carrier replicas to compute the following in-phase and quadrature correlation accumulations:
Iβ(A) = ∑ y(tj)CJ[0.001(^r—^)] cosiωσt, -[φjk + ώDιφt,Jk(t, -fjk)]} (4)
β (Δ) -[φ
jk + ώ
Doppjk(t, -χ
jk)]} (5)
where is the index of the first RF front-end sample time that obeys iβ ≤ tik and Nk+1 is the total number of samples that obey x β < t_ < i +\ ■ The time offset A causes the replica
PRN code to play back early if it is positive and late if A is negative. The prompt correlations are defined by equations (4) and (5) with Δ = o. The early-minus-late correlations are
_ ere
eml is the spacing between the early and late PRN carrier replicas. The present invention described herein is an efficient technique for the receiver to accumulate
}k and
]k in software. Referring now to FIG. 1, the operational platform of the software receiver 10 of the present invention includes an antenna 11, conventional RF front-end 13, a data acquisition (DAQ) system 17, a microprocessor 16, a software correlator 19, and application-specific code 15. Conventional RF front-end 13 interfaces with antenna 11 and with (DAQ) system 17. DAQ system 17 includes a system of shift registers and a data buffer. Microprocessor 16 executes software correlator 19, which includes a set of specially developed bit-wise parallel algorithms, and application-specific code 15, such as the GPS navigation and tracking functions. In the illustrative embodiment, conventional GPS software functions (signal
tracking, data extraction, navigation solution, etc.) are provided by the MITEL® GPSArchitect software ported to RTLL UX® (see A Coming of Age for GPS: A R7UNUX BASED GPS RECEIVER, Ledvina et al.. Proceedings of the Workshop on Real Time Operating Systems and Applications and Second Real Time Linux Workshop Tin conjunction with IEEE RTSS 2000^ November 27-28, 2000), but can be provided by any equivalent configuration.
Continuing to refer to FIG. 1, conventional RF front-end 13 can, for example, be a MITEL® GP2015 RF front-end, which down converts the nominal 1.57542GHz GPS signal 12 to an intermediate frequency of (88.54/63) x 106 Hz ~ 1.4053968254 MHz and then performs analog-to-digital conversion. The resultant, digitized signal data 21 has a predetermined number of bits/sample, such as two binary bits/sample, a sign bit and a magnitude bit, or one bit/sample. The shift registers in the DAQ system 17 parallelize the magnitude and sign data bit streams into separate words, which the DAQ system 17 reads into the memory of microprocessor 16 using DMA. To make the process of reading data into the microprocessor 16 more efficient and to prepare for efficient correlation calculations, DAQ system 17 can read a pre-specified number of bits of buffered samples, such as thirty-two bits, at a time. The exemplary thirty-two bits include sixteen sign bits and sixteen magnitude bits.
Referring now to FIGs. 1 and 2A, the shift registers in DAQ system 17 (FIG. 1) buffer signal data 21 (FIGs. 1 and 2A) and pack signal sign 21 A (FIG. 2A) and signal magnitude 2 IB (FIG. 2A) into separate words, that represent the integer values ±1 and ±3 as is shown in Table 1. In the case of a 1-bit signal, the bit stream representing the samples is packed into successive words to prepare the signal for bit-wise parallel processing. DAQ system 17 also provides for accurate timing by, for example, synchronizing signal sign 21 A and signal magnitude 21B to a (40/7) x 10
6Hz ~ 5.714 MHz clock signal, which can be, for example, a third output from conventional RF front-end 13 (FIG. 1). DAQ system 17 can convert the 5.71424MHz clock signal down to 357.14KHz by use of, for example, a divide-by-16 counter for a 16-bit word, which can provide a signal indicating when the buffer is full. DAQ system 17 can use any method for providing a buffer full indication.
Table 1. The sign and magnitude combinations of the sample RF output of the conventional RF front-end and their corresponding values.
With further reference to FIG. 1, in the illustrative embodiment, the DAQ system 17 can consist of an interface card and driver software that can be compatible with, for example, a 1.73GHz AMD ATHLON® processor running RTLINUX®, but could be compatible with any operating system and any processor that can accommodate real-time operations. The interface card can, for example, be a NATIONAL WSTOUMENTS® PCI-DIO-32HS digital I/O card. Pertinent features of this card are the thirty-two digital input lines, DMA, and availability of a driver for RTLINUX®, perhaps gotten from the suite of open source drivers and application interface software for interface cards known as COMEDI (COntrol and MEasurement and Device Interface). Modifications to the conventional COMEDI driver for the PCI-DIO-32HS card include increasing the number of input bits from sixteen to thirty- two, enabling DMA, and modifying the driver to support continuous interrupt-driven acquisition.
With still further reference to FIG. 1, microprocessor 16 can be, for example, a 1.73 GHz AMD ATHLON™ processor running the RTXINUX® operating system, but any operating system and processor that can accommodate real-time operations can be used. Low latency interrupt responsiveness, the ability to execute threads at regular intervals, with the kernel having a possibility of being the lowest priority thread, and reliable execution of time- critical code are among features of an operating system that could enhance the performance of the system of the present invention. The use of RTLINUX® is presented herein for illustrative purposes only.
Continuing to refer to FIG. 1, analogous to a hardware correlator that takes input directly from the RF front end in serial fashion, software correlator 19 reads from a shared memory buffer that both software correlator 19 and DAQ system 17 can access, the former to read data, and the latter to write data. The shared memory buffer can be implemented as a
DMA memory space and a circular buffer. In the illustrative embodiment in which the system and method of the present invention are used in a GPS (or similar) environment, microprocessor 16 can store the most recent twenty-one milliseconds of signal data 21 (FIGs. 1 and 2A) in the circular buffer, but could store more or less. The present invention does not fix the size of the circular buffer, nor the amount of RF data that can be stored there. The circular buffer allows the processing of code periods that start and stop at different times for different satellites during different iterations of a regularly scheduled program thread. DMA memory space can be written to directly by DAQ system 1 using a DAQ software driver, which fills the circular buffer. Communication between software correlator 19 and application-specific code 15 can be performed using operating system-provided shared memory capability. For example, the mbuff driver, included with RTLINUX®, can be used to create and manage this shared memory space. Any memory management system that accommodates real-time processing can be used. If the mbuff driver is used, kernel modules can share memory and the kernel can be restricted from swapping the shared memory space to long-term storage.
Continuing with the analogy to hardware correlation, and still referring primarily to FIG. 1, in hardware correlation, the correlator receives frequency and phase information from tracking and acquisition loops that are part of application code, and Numerically Controlled Oscillators (NCOs) generate signals that correspond to the written frequencies and phases. In contrast, software correlator 19 includes simulated carrier and code NCOs that receive their frequency commands from application-specific code 15. Software correlator 19 uses these frequency commands to reconstruct carrier replica signal 25 (FIG. 3 A) and prompt PRN code 29 and early-minus-late PRN code 35 (FIG. 3A) which it mixes with the signal data 21 (FIG. 2A) resulting in fully mixed prompt integrand 31 and fully mixed early-minus-late integrand 33 (FIG. 3A).
To further continue the analogy, a hardware correlator generates in real-time a particular C/A code replica at the correct Doppler shifted frequency and phase. In contrast, software correlator 19 can generate C/A codes off-line and store them in a memory table, the pre- computed over-sampled PRN code table 28 (FIG. 3A). The pre-computed over-sampled PRN code table 28 is used to select PRN codes with the correct timing relationship to the sample
times of signal data 21 (FIG. 3 A). The codes are then used to form correlations with baseband mixed signals 23 (FIG. 3 A), the result from which is summed to produce the standard in-phase and quadrature, summed prompt accumulation 45 (FIG. 3B) and summed early-minus-late accumulation 47 (FIG. 3B) that are equivalent to what would be produced by a continuously variable sine wave. These are provided to application-specific code 15, such as conventional GPS software that executes signal tracking and navigation functions. In a second approach, software correlator 19 can generate the PRN carrier replicas on-line at the code chipping rate and can use tabulated functions to re-sample the code at the sample rate of the RF front-end for purposes of calculating accumulations. Real-time over-sampled PRN code generator 30A (FIG. 3A) is used in place of pre-computed over-sampled PRN code table 28 (FIG. 3 A) in this latter approach. This latter method can be used with longer PRN codes, such as the new civilian GPS L2 CL codes.
With still further reference to FIG. 1, since the received LI raw signal 12 can have an uncertain carrier phase, software correlator 19 computes both in-phase (I) and quadrature (Q) accumulations, as defined in equations (4) and (5). Software correlator 19 begins the accumulation process by using carrier replica signal 25 (FIG. 3A), which it gets from pre- stored carrier replica table 30 (FIG. 3 A). The carrier replicas in this table fall on a rough frequency grid, and they all start with a particular phase, for example a phase of zero. The baseband mixing process involves selecting a carrier replica signal 25 (FIG. 3A) from carrier replica table 30 (FIG. 3A) that is at the frequency that is as close to "ideal" as possible. In the case of a 175 Hz grid spacing, the baseband mixing process selects a signal that is maximally within ±87.5 Hz of the ideal signal. The rough frequency grid can have a spacing of, for example, 175 Hz but could be larger or smaller depending on (a) the frequency range needed to cover, for example, ±10 Khz, (b) the amount of space available for storing pre-computed signals, and (c) other design decisions. The pre-computed signals in carrier replica table 30 (FIG. 3A) each may occupy 18032-bit words in order to be guaranteed to cover the full 5,714 RF front-end samples that occur in one PRN code period for any possible code period start time within the thirty-two samples of the initial word. Thus, 180*4=720 bytes could be required for each bit of each pre-computed carrier replica signal 25 that is stored in the table. The sine and cosine waves of carrier replica signals 25 (FIG. 3 A) each have 2-bit
representations, which translates into a storage requirement of 2880 bytes for the carrier replica signals 25 at a given Doppler shift. There are 115 Doppler shifts that may be stored in order to cover the -10 KHz to +10 KHz range with a 175 Hz grid spacing. This translates into 323 Kbytes of storage for all of the carrier replica signals 25. This approach avoids the need to pre-compute sine waves with a prohibitively large number of possible frequencies and phase offsets and it avoids the need to compute sine waves in real-time. Instead, the errors created by using pre-defined sine wave replicas are compensated for by post-processing calculations, as described below.
In any case, and continuing to refer to FIG. 1, the resulting accumulations are cos[(ω
IF -ω
Bk)(t, - t
θBk)] (6)
where ωgk is the grid frequency that is closest to the estimated frequency ωDomk and where t0gk is the time at which this carrier replica signal 25 (FIG. 3 A) has zero carrier phase
Software correlator 19 rotates these accumulations in order to create accurate approximations of what would have been computed had the estimated carrier phase time history in equations (4) and (5) been used:
Ilk(A) = Isk(A)cos(Aφavsk) + Qsk(A)sin(Aφavsk) (8)
Qjk(A) -= -Isk(A)sin(Aφ→) + Qak(A)cos(Aφm ) (9) where Aφ k is the average phase difference between the grid carrier phase and the estimated carrier phase averaged over the accumulation interval:
Note that equations (8), (9), and (10) are an illustrative example of how software correlator (19) can rotate its I and Q accumulations in order to correct for phase and frequency
errors in its table of pre-computed carrier replica signals. There exist other formulas that yield equivalent results, and this patent disclosure covers all such techniques. The validity of equations (8) and (9) is dependent on the assumption that
1 -cos[-(ω - ώDotøkKτβ+i - tj )] « 1 (11) For example, a 175 Hz grid spacing and a nominal C/A PRN code period of 0.001 sec yields a value on the left-hand side of inequality (11) of 0.04, which respects the assumed limit.
Note that equations (8) and (9) can be derived from equations (4) and (5) as follows. First, the carrier phase of the grid signal in the arguments of the cosine and sine terms of equations (6) and (7) are added to and subtracted from the arguments of the cosine and sine terms in equations (4) and (5). Next, trigonometric identities are used to split the resulting cosine and sine terms into sums of products of cosine and sine functions. In each product, one of the terms involves an argument like the arguments in the trigonometric terms in equations (6) and (7). The other trigonometric terms are then approximated by either cos(Aφ
m/gjk) or sin^Φ
avgk) ■ These approximations are valid because of the inequality in equation (11) and because the average of
interval is zero.
A decrease in the carrier to noise ratio C/No, which characterizes the receiver's sensitivity, is caused by the use of an inexact baseband mixing frequency. The worst-case decrease is expressed as a function of the frequency grid spacing Δ/ and is given by
where Δ/ is in units of Hz, and Eis the integration period. Thus, a A/ of 175 Hz causes aworst-case C/Νo loss of 0.11 dB for T= 0.001 sec.
Referring now to FIGs. 2A, 2B, and 3 A, PR codes (composed of prompt PRΝ codes 29 (FIG 3 A) and early-minus-late PRΝ codes 35 (FIG. 3A)) are either pre-computed or generated in real-time. Pre-computing involves, for each satellite, computing an entire PRΝ code, storing the PRΝ code appropriately for easy retrieval, and referencing the PRΝ code,
possibly by means of indices that are computed based on, for example, the incoming RF signal data 21 (FIGs. 2A and 3 A). Pre-computing can be most advantageously used when the PRN code is not very long. Generating PRN codes in real-time can be a more appropriate solution when the PRN codes are very long (and thus would require an unacceptable amount of storage), or perhaps when too many PRN codes are required for the amount of storage available, or for any other reason, but real-time PRN code generation can entail an additional computational cost. Both pre-computing and real-time determination of PRN codes are described herein with respect to a bit-wise parallel implementation.
Continuing to refer primarily to FIGs. 2A, 2B, and 3A, in order to perform bit-wise parallel operations, software correlator 19 (FIG. 1) stores pre-computed carrier replica sign 25 A (FIGs. 2B and 3 A) and carrier replica magnitude 25B (FIGs. 2B and 3 A) in data words. Simple representations of signal data 21 (FIGs. 2 A and 3 A) and carrier replica signal 25 (FIGs. 2B and 3 A) in terms of one, two, or more bits are suitable for using bit-wise parallelism to perform the calculations described herein. Bit-wise parallel operations work with representations of the data that store successive samples in successive bits of a word. For example, thirty-two samples (bits) of the RF front-end output are stored in two N= 32-bit words, signal sign 21 A (FIGs. 2 A and 3 A) and signal magnitude 2 IB (FIGs. 2A and 3 A), or simply a single 32-bit word if signal data 21 consists of a single data bit. Carrier replica sign 25A (FIGs. 2B and 3A) and carrier replica magnitude 25B (FIGs. 2B and 3A) are stored, for example in tables, in separate words, with each 32-bit word storing thirty-two sign or magnitude bits that tabulate to thirty-two successive samples of the corresponding cosine or sine wave. Similarly, tables can store prompt PRN code 29 (FIG. 3A) and early-minus-late PRN code 35 (FIG. 3 A), which are composed of prompt PRN code sign 29A (FIG. 3 A), early-minus-late PRN code sign 35A (FIG. 3A), and early-minus-late PRN code zero mask 35B (FIG. 3A). The data words that comprise the bit-wise parallel representations of these three signal types, the original RF signal data 21 (FIGs. 2A and 3A), the carrier replica signal 25 (FIGs. 2B and FIG. 3A), and the de-spreading prompt PRN code 29 (FIG. 3A) and early- minus-late PRN code 35 (FIG. 3A), are the inputs to the calculations of software correlator 19 (FIG. 1).
Further continuing primarily to refer to FIGs. 2A, 2B, and 3 A, many intermediate calculated quantities and at least three types of intermediate signals are also stored in bit-wise parallel format. First there are the in-phase and quadrature baseband mixed signals 23 (FIG. 3 A), whose 3-bit representations for the illustrative embodiment are stored as baseband mixed sign 23A (FIG. 3A), baseband mixed high magnitude 23B (FIG. 3A), and baseband mixed low magnitude 23C (FIG. 3A). The second bit-wise parallel signal type is the folly mixed integrand, of which there are four signals: in-phase and quadrature folly mixed prompt integrand 31 (FIG. 3 A) and in-phase and quadrature folly mixed early-minus-late integrand 33 (FIG. 3A). The former are stored as 3-bit representations in the illustrative embodiment as fully mixed prompt integrand sign 31 A (FIG. 3 A), fully mixed prompt integrand high magnitude 3 IB (FIG. 3 A), and folly mixed prompt integrand low magnitude 3 IC (FIG. 3 A). The latter are stored as 3.5-bit representations in the illustrative embodiment as folly mixed early-minus-late integrand sign 33 A (FIG. 3A), folly mixed early-minus-late integrand high magnitude 33B (FIG. 3A), folly mixed early-minus-late integrand low magnitude 33C (FIG. 3 A), and folly mixed early-minus-late integrand zero mask 33D (FIG. 3 A). This representation is called a 3.5-bit representation because the sign, high-magnitude, and low- magnitude bits are ignored if the corresponding zero mask bit has the value zero. The third bit-wise parallel signal type is a value word, of which there are two types: prompt integrand value words 27 (FIG. 3B) and early-minus-late integrand value words 37 (FIG. 3B). Each folly mixed integrand is used to construct value words, one word for each possible value that the integer integrand can take on. There are eight possible values for the integrands of the illustrative embodiment: -1, -2, -3, -6, 1, 2, 3, and 6 for the in-phase and quadrature folly mixed prompt integrands 31 (FIG. 3 A) and -2, -4, -6, -12, 2, 4, 6, and 12 for the in-phase and quadrature folly mixed early-minus-late integrands 33 (FIG. 3A). Each bit-wise parallel value word contains a one bit for each sample time when the integrand value equals the value of the value word, but it contains a zero bit for all other sample times. The storage of raw data and intermediate results in bit-wise parallel format allows the EXCLUSIVE OR operations that are involved in mixing to operate .on thirty-two samples at a time if microprocessor 16 (FIG. 1) has a bit-wise EXCLUSIVE OR command. Other bit-wise commands are used to perform additional software correlation operations in parallel on sets of two thirty-two samples.
At this point, the problem of over-sampling is introduced. Referring now to FIGs. 2C and 2D, the problem of over-sampling is illustrated with respect to bit- wise parallelism as follows. There is normally more than one RF data sample per PRN code chip. The three successive -1 values 73 (FIG. 2C) at sample times ti to t3 all occur during the same PRN code chip as do the four successive +1 values 75 (FIG 2C) at times through tη. The difference in the number of samples for the two code chips arises because the PRN code chip period is not an integer multiple of the sample period. Analogously, referring to FIG. 2D, where sample interval Δts 63 is less than actual PRN code chip length Δtc 65, over-sampling is indicated because the RF sampling frequency/ = lΛ t, is greater than the PRN code chipping frequency fc = II Atc. PRN codes for CDMA signaling are sequences of +1 and -1 values, the elements of which are chips. Over the time intervals of interest, a carrier replica progresses through its chips at a constant chipping rate of/c = 1/Atc chips/second. The time interval zltc is the actual PRN code chip length 65 (FIG. 2D). Software correlator 19 (FIG. 1) normally receives PRN code, and attempts to align it with the prompt replica version of the code, prompt PRN code 29 (FIG. 2D). It makes use of the signal's correlation with prompt PRN code 29 (FIG. 2D) and with early-minus-late PRN code 35 (FIG. 2D) in order to determine a chipping rate that tends to align prompt PRN code 29 (FIG. 2D) as desired. Conventional methods for determining ; are well-known in the art. Chips of early code 69B (FIG. 2D) start and stop 0.5Atemι seconds before the corresponding chips of prompt PRN code 29 (FIG. 2D), and the chips of late code 69C (FIG. 2D) start and stop 0.5 Atemt seconds after prompt PRN code 29 (FIG. 2D). Early-minus-late PRN code 35 (FIG. 2D) is the difference between early code 69B (FIG. 2D) and late code 69C (FIG. 2D). Example segments of these four types of replica codes are depicted in FIG. 2D.
Referring to FIGs. 1, 2A, 2C, and 2D software correlator 19 (FIG. 1) receives, through conventional RF front end 13 and DAQ system 17, signal data 21 (FIG. 1), the raw data 12 (FIG. 1) source of which is sampled at the rate/ = 1/Ats Hz. In order to process the resulting RF signal data 21, software correlator 19 (FIG. 1) needs prompt PRN code 29 (FIG. 2D) and early-minus-late PRN code 35 (FIG. 2D) replicas sampled at the same times as raw signal 12 (FIG 1). FIG. 2D depicts sixteen sample times as vertical dash-dotted lines. Referring to FIG. 2D, prompt PRN code 29 (FIG. 2D) can be represented by its prompt PRN code sign
29 A (FIG. 2D) at the sample times. The bit value one represents +1, and the bit value zero represents -1. Prompt PRN code sign 29 A (FIG. 2D), shown at the sixteen sample times ~ starting with three Is, continuing with ten Os, and finishing with another three Is — is a 16-bit word stored as the integer 215+214+213+22+21+2° = 57351. Early-minus-late PRN code 35 (FIG. 2D) requires a 1.5-bit representation. A zero mask bit is set to zero if early-minus-late PRN code 35 takes on the value zero, and it is set to one if early-minus-late PRN code 35 equals +2 or -2. Early-minus-late PRN code zero mask 35B (FIG. 2D) at sixteen sample times shown in FIG. 2D is equivalent to 213+212+22 = 12292. A 2's sign bit is set to one if early-minus-late PRN code 35 (FIG.2D) equals +2 at the sample time, and it is set to zero if the code equals -2. The 2's sign bit is irrelevant if the corresponding early-minus-late PRN code zero mask 35B (FIG. 2D) bit equals zero. Early-minus-late PRN code sign 35A (FIG. 2D) for sixteen sample times contains X values that indicate bits whose values are irrelevant because the corresponding early-minus-late PRN code zero mask 35B (FIG 2D) bits are zero. In an illustrative embodiment, all the X values become zero, thus the equivalent integer for early-minus-late PRN code sign 35 A (FIG. 2D) is 22 = 4.
Continuing to refer to FIG. 3A, an alternative to taking the prompt PRN code 29 and early-minus-late PRN code 35 from pre-computed over-sampled PRN code table 28 is to generate prompt PRN code sign 29 , early-minus-late PRN code sign 35N and early-minus- late PRN code zero mask 35B using real-time over-sampled PRN code generator 30A (FIG 3A). Shown in FIG. 3A are two circles and a loose arrow with a quarter circle pointer. These are the symbols for a switch and indicate the ability of the system to choose possible alternate sources of PRN code. Using the real-time over-sampled PRN code generator 30A includes a step of generating the PRN code chips in real-time by conventional means. For example, the GPS civilian L2 CL and CM codes are generated by a 27-bit feedback shift register (see The New L2 Civil Signal, R.D. Fontana et al., Proceedings of the ION GPS 2001. September 11- 14, 2001, Salt Lake City, UT, pp. 617-631). The method further includes the steps of choosing chip values from the PRN code, where the chip values correspond to a data interval that contains the samples of a data word and where the chips have a known timing relative to the data interval, transforming the relative timing into a time grid index, and translating the PRN code chip values and the time grid index for the data interval into the PRN code's over-
sampled bit-wise parallel format. These latter steps can be carried out efficiently by using a table look up function. One table each for prompt PRN code sign 29 A, early-minus-late PRN code sign 35A, and early-minus-late PRN code zero mask 35B can include integer values that constitute the bit-wise parallel representation of the PRN code for the sample times associated with the data word in question. Indices into each 1 -dimensional table are functions of (a) the time offset between the first PRN code chip and the first sample time of the given data word, and (b) the bit pattern of the PRN code chips that span the sample times of the data word. The sizes of the tables are independent of the period of the PRN code that is being over-sampled. The tables can be re-used for multiple PRN codes in a multi-channel receiver. The computation and use of the tables are discussed in more detail later.
Continuing with the description of bit-wise parallelism with respect to the operations of software correlator 19 (FIG. 1), and continuing to refer to FIG. 3A, the specially-developed algorithms described herein make use of bit-wise parallelism so that a single programming language statement, such as a C code command, can partially process up to thirty-two samples at a time. Previously referred-to carrier replica signal 25 in the form of cosine and sine signals are stored as binary carrier replica sign 25 A and carrier replica magnitude 25B. The format of this representation is defined in Table 2 and illustrated in FIG. 3C which is a reconstructed carrier and carrier replica in the form of representative sine signal 51 (FIG. 3C) shown in optimal 2-bit representation 53 (FIG. 3C) that has the minimum square error. The format of Table 2 assumes that the cosine and sine signals have an amplitude of approximately 2.4. Note that other representations, beyond 2-bit representation, are possible. In general, more bits yield a better SNR, but can also require a larger number of computations for the correlation operations.
baseband mixing carrier sine wave replicas and the values that they represent
Continuing to refer to FIG. 3A, multiplication of the RF front-end output representation, the signal sign 21 A and signal magnitude 21B, of Table 1 by the sine wave representation, carrier replica sign 25A and carrier replica magnitude 25B, of Table 2 yields baseband mixed signals 23, consisting of baseband mixed sign 23 N baseband mixed high magnitude 23B, and baseband mixed low magnitude 23C, that can take on the values -6, -3, -2, -1, +1, +2, +3, and +6, as shown in Table 3. Baseband mixed high magnitude 23B is simply signal magnitude 2 IB, and baseband mixed low magnitude 23 C is carrier replica magnitude 25B. Thus, these two magnitude bits are available without the need for computation. Baseband mixed sign 23A is the result of an EXCLUSIVE OR operation between signal sign 21 A and carrier replica sign 25 A. Notice how the relationship of the sign bit value with the actual sign gets reversed from that of Tables 1 and 2.
Table 3. Sign, high-magnitude, and low-magnitude combinations of the baseband- mixed signal and their corresponding signal values.
Continuing to refer to FIG. 3 A, and continuing to describe the bit-wise parallel algorithms, the required amount of storage for tables of pre-computed prompt PRN code 29 and early-minus-late PRN code 35 can be greatly reduced by making two simplifications. First, the prompt PRN code 29 is stored as prompt PRN code sign 29 A. This representation is shown in Table 4. The early-minus-late PRN code 35, on the other hand, is stored in a two-bit representation (actually a 1.5 bit representation): early-minus-late PRN code sign 35A and early-minus-late PRN code zero mask 35B, as denoted in Table 5. Note that the X in the first column of Table 5 indicates that zero or one can be placed in this location without affecting the corresponding code value. The X signifies a lack of effect of the sign bit on the code
value when the zero mask bit equals zero. This is why the early-minus-late PR code 35 representation is referred to as a 1.5-bit representation. This X value will affect the corresponding folly mixed early-minus-late integrand sign 33 A, but it will not affect any of the early-minus-late value words because the zero value in the corresponding zero mask location will null out the corresponding bit of all early-minus-late value words.
Table 4. Sign bits of the prompt C/A code and the corresponding prompt signal values.
Table 5. Sign and zero mask bit combinations of the early-minus-late PRN code 35 and the corresponding signal values.
Another simplification in the pre-computed over-sampled PRN code table 28, and continuing to refer to FIG. 3A, can be to ignore code Doppler shift variations. All signals in the table are assumed to have zero Doppler shift; i.e., all C/A codes in the table assume that Xjkw~Tjk- 0.001 sec. Note that the period of 0.001 is applicable for accumulations that use the foil 1023 chips of the C/A code only. Any other type of code or accumulation interval may have a different period. The code phase errors due to this assumption can be eliminated by choosing a replica code from the pre-computed over-sampled PRN code table 28 whose midpoint occurs at the desired midpoint time (f_*+£ _,fc+ι)/ . The only other effect of this assumption can be a small correlation power loss, which is no more than 0.014 dB if the magnitude of the Doppler shift is less than 10 KHz. The pre-computed over-sampled PRN code table 28 should include a selection of different phases, for example fourteen, as measured relative to a signal sample spacing of, for example, 175 nsec. This translates into a
code phase spacing of, for example, 12.5 nsec, which equals a pseudo range measurement digitization level of 3.8m, or a maximum measurement error of 1.9m. The number of phases in the pre-computed over-sampled PRN code table 28 is dependent upon the design of the system and no set number of phases is required by the present invention. Referring to FIG 6, suppose that pre-computed over-sampled PRN code table 28 stores over-sampled bit-wise parallel representations of chips C(l) through C(M). The table must allow for the retrieval of over-sampled bit-wise parallel code replicas for a range of start times of code chip C(l) that span the entire first data sample word in the accumulation interval W\ 95 (FIG 6). The table may contain code replicas whose different phases yield start times that span only a single sample interval of data word W\ 95 (FIG 6), which is only l/«s of the required number of start times. In this case the software correlator may apply bit shift operations to a tabulated PRN code replica from that sample interval in order to generate the over-sampled bit-wise parallel PRN code replica that applies when chip C(l) starts in a different sample interval of data word ^i 95 (FIG 6). Continuing to refer to FIG. 3 A, and further continuing to describe the bit-wise parallel algorithms, prompt PRN code 29 and early-minus-late PRN code 35 replicas can be mixed with the baseband mixed signals 23 to form folly mixed prompt integrand 31 by an EXCLUSIVE OR operation and bit re-definitions. An EXCLUSIVE OR between prompt PRN code sign 29A and baseband mixed sign 23 A produces folly mixed prompt integrand sign 31 A given in Table 6. The folly mixed prompt integrand high magnitude 3 IB and folly mixed prompt integrand low magnitude 3 IC are baseband mixed high magmtude 23B and baseband mixed low magnitude 23C, also given in Table 6. Note that the Table 6 representation is identical to that of Table 3 except for the inversion in the meaning of the sign bits. The number of magnitude bits is dependent upon the design of the system and no set number of magnitude bits is required by the present invention. A change in the number of magnitude bits will cause a change in the number of entries of the equivalent of Table 6 and it will affect the possible values of the integrand.
Table 6. Sign, high-magnitude, and low-magnitude bit combinations of the folly mixed prompt integrand 31 and its corresponding values.
Still continuing to refer to FIG. 3 A, the mixing of the early-minus-late PRN code 35 with the baseband mixed signals 23 forms fully mixed early-minus-late integrands 33. Fully mixed early-minus-late integrand sign 33 A is an EXCLUSIVE OR between early-minus-late PRN code sign 35A and baseband mixed sign 23 A. Fully mixed early-minus-late integrand high magnitude 33B and folly mixed early-minus-late integrand low magnitude 33C are, as above, baseband mixed high magnitude 23B and baseband mixed low magnitude 23C. Fully mixed early-minus-late integrand zero mask 33D is early-minus-late PRN code zero mask 35B. The resulting representation is given in Table 7. As in Table 5, each X entry in the table indicates that the corresponding bit can be either zero or one without affecting the corresponding integrand value.
1 1 1 1 +12
Table 7. Sign, high-magnitude, low-magnitude, and zero mask bit combinations of the fully mixed early-minus-late integrands 33 and their corresponding values.
Referring now to FIGs. 3A, 3B, 4A, and 4B, the method for computing in-phase and quadrature accumulations for every accumulation period, for example every millisecond for GPS C/A code, by use of bit-wise parallelism includes the steps of selecting carrier replica signal 25 (FIG. 3 A) according to the proximity of its frequency to the desired frequency, and representing sample signal data 21 (FIG. 3 A) and carrier replica signal 25 (FIG. 3 A) from at least one channel as bits in signal sign 21A (FIG. 3A) and, if present, signal magnitude 21B (FIG. 3A) and carrier replica sign 25A (FIG. 3A) and carrier replica magnitude 25B (FIG. 3 A) (method step 101, FIG. 4A). Note that carrier replica signal 25 (FIG. 3 A) is chosen so that its frequency is close to the correct signal frequency. The method also includes the step of mixing signal data 21 (FIG. 3 A) to baseband by computing in-phase and quadrature baseband mixed sign 23 A (FIG. 3 A) and in-phase and quadrature baseband mixed high and low magnitude 23B/C (FIG. 3 A) (method step 103, FIG. 4A). The method forther includes the steps of selecting PRN code from pre-computed over-sampled PRN code table 28 (FIG. 3A) or of computing it using real-time over-sampled PRN code generator 30A (FIG 3A), representing prompt PRN code 29 (FIG. 3A) as prompt PRN code sign 29A (FIG. 3A), and representing early-minus-late PRN code 35 (FIG. 3 A) from as early-minus-late PRN code sign 35A (FIG. 3A) and early-minus-late PRN code zero mask 35B (FIG 3A) (method step 105, FIG. 4A). The method forther includes the step of de-spreading in-phase and quadrature baseband mixed signal 23 (FIG. 3A) by mixing it with prompt PRN code 29 (FIG. 3 A) and early-minus-late PRN code 35 (FIG 3 A), resulting in in-phase and quadrature folly mixed prompt integrands 31 (FIG. 3 A), and folly mixed early-minus-late integrands 33 (FIG. 3 A) (method step 107, FIG. 4A). The method forther includes the step of using prompt value word logic 27A (FIG. 3B) to compute prompt integrand value words 27 (FIG. 3B) from the in-phase and quadrature folly mixed prompt integrands 31 (FIG. 3 A). The method forther includes the step of using early-minus-late value word logic 37A (FIG. 3B) to compute early- minus-late integrand value words 37 (FIG. 3B) from the fully mixed early-minus-late
integrands 33 (FIG, 3B) (method step 109, FIG. 4A). The method further includes the steps of summing over each prompt integrand value word 27 and early-minus-late integrand value word 37 (FIG. 3B) the number of one bits (or zero bits) using one bits summation table 38 (FIG. 3B) or using a processor command if available (method step 111, FIG. 4B), and summing, over the accumulation interval, the number of one bits (or zero bits) in each prompt integrand value word 27 and early-minus-late integrand value word 37 to produce prompt accumulations 41 (FIG. 3B) and early-minus-late accumulations 49 (FIG. 3B) (method step 113, FIG. 4B). The method forther includes the step of multiplying prompt accumulations 41 (FIG. 3B) and early-minus-late accumulations 49 (FIG. 3B) by corresponding values 41A and summing the results over the value words of each signal for an entire accumulation interval to yield in-phase and quadrature summed prompt accumulations 45 (FIG. 3B) and summed early-minus-late accumulations 47 (FIG. 3B) (method step 115, FIG.4B) that are stored for use by acquisition techniques or tracking loops. The method forther includes the step of rotating the in-phase and quadrature summed prompt accumulations 45 (FIG. 3B) and summed early-minus-late accumulations 47 (FIG. 3B) (method step 117, FIG.4B) to simulate a condition in which baseband mixing had been performed using cosine and sine signal replicas with the correct frequency and phase. If there are more channels to process (decision step 119, FIG. 4B), the method includes the step of repeating the previous steps beginning at method step 101, FIG. 4A. If there are no more channels to process (decision step 119, FIG. 4B), the method includes the step of setting parameters for the next accumulation period, including storing current C/A code phases, epoch counters, carrier phases, and carrier Doppler shifts (method step 121, FIG. 4B). If the time period to wait until the next accumulations need to be calculated has not expired (decision step 123, FIG. 4B), the method includes the step of sleeping until the expiration of the time period (method step 125, FIG. 4B). If the time period has expired (decision step 123, FIG. 4B), the method includes the step of repeating the previous steps beginning at method step 101, FIG. 4A. The length of the time period depends on the nominal accumulation period. It is set to be less than this period, normally between 50% to 90% of this period, to reduce the possibility that accumulations are missed for any channels.
Referring again to FIGs. 3B and 4A, method step 109 (FIG. 4A) calls for computing value words. This computation starts by performing bit-wise parallel Boolean logic for each of the possible values in the right-hand column of the prompt integrand representation in Table 6. A 32-bit prompt integrand value word 27 (FIG. 3B) is computed for each thirty-two samples and each row of Table 6. The prompt integrand value word 27 (FIG. 3B) contains ones for the sample times when the actual integrand equals the corresponding value in the right-hand column of Table 6, and zeros for the remaining times when the actual integrand does not equal this value. The prompt integrand value words 27 (FIG. 3B) corresponding to the possible Table 6 values are formed by method step 109 (FIG. 4A) as follows: MINUSONE = NOT(SIGN) AND [NOT(HIGHMAG) AND NOT(LOWMAG)] (13)
MLNUSTWO = NOT(SIGN) AND [NOT(fflGHMAG) AND LOWMAG] (14)
MLNfUSTHREE = NOT(SIGN) AND [HIGHMAG AND NOTfLOWMAG)] (15)
MLNUSSIX = NOT(SIGN) AND [HIGHMAG AND LOWMAG] (16)
PLUSONE = SIGN AND [NOT(HIGHMAG) AND NOTQLOWMAG)] (17) PLUSTWO = SIGN AND [NOT(HIGHMAG) AND LOWMAG] (18)
PLUSTHREE = SIGN AND [HIGHMAG AND NOT(LOWMAG)] (19)
PLUSSLX = SIGN AND [HIGHMAG AND LOWMAG] (20)
Continuing to refer to FIGs. 3 A, 3B, 4A, and 4B, method steps 109 (FIG. 4A), 111 (FIG 4B), and 113 (FIG. 4B) call for operations for the folly mixed early-minus-late integrands 33 (FIG. 3 A) that are similar to those for the folly mixed prompt integrands 31 (FIG. 3A). Early-minus-late integrand value words 37 (FIG. 3B) correspond to values that are double those of the prompt integrand value words 27 (FIG. 3B), i.e., the MINUS SIX word becomes the ML USTWELVE word. Also, an additional AND operation must be performed with the zero mask bits of Table 7 in order to mask out sample times when the early and late PRN codes cancel each other. Possible formulas for the method step 109 (FIG. 4A) computation of these early-minus-late integrand value words 37 (FIG. 3B) are as follows: MINUSTWO = [ZEROMASK AND NOT(SIGN)] AND [NOT(HIGHMAG) AND NOT(LOWMAG)] (21)
MfNUSFOUR= [ZEROMASK AND NOT(SIGN)] AND |NOT(fflGHMAG) AND LOWMAG] (22)
ML USSLX = [ZEROMASK AND NOT(SIGN)] AND [HIGHMAG AND NOT(LOWMAG)] (23) MTNUSTWELVE = [ZEROMASK AND NOT(SIGN)] AND [HIGHMAG AND
LOWMAG] (24)
PLUSTWO = [ZEROMASK AND SIGN] AND [NOT(HIGHMAG) AND NOT(LOWMAG)] (25)
PLUSFOUR= [ZEROMASK AND SIGN] AND [NOT(HIGHMAG) AND LOWMAG] (26)
PLUSSLX = [ZEROMASK AND SIGN] AND [HIGHMAG AND NOT(LOWMAG)] (27)
PLUSTWELVE = [ZEROMASK AND SIGN] AND [HIGHMAG AND LOWMAG] (28) Additional zero masking can occur in the first and last words of an accumulation interval. This is true because the start and stop times of an accumulation interval do not normally fall at the boundaries of data words. Therefore, the bits in the first word that precede the accumulation interval may need to get zero masked as might the bits in the last word that come after the end of the accumulation interval. Referring primarily to FIGs. 3B and 4B, the one bits counting operations of method step
111 (FIG 4B) form the count of the number of one bits in each of the eight value words. If there are no such counting operations in the instruction set of microprocessor 16 (FIG. 1), the counting can be accomplished using a table look-up. In the case of a table look-up, prompt integrand value words 27 and early-minus-late integrand value words 37 (FIG. 3B) can be used as addresses in one bits summation table 38 (FIG. 3B), and one bits summation table 38 (FIG. 3B) can output the number of one values (or zeros) in the address. For example, if the table look-up operation is called BITSUM, the following computations can be performed to compute one-bits counts:
ONESCOUNT = BΓTSUM(VALUEWORD) (29)
where the output of the table ONESCOUNT is the number of one bits in the word VALUEWORD. This operation is repeated for each of the prompt integrand value words 27 (FIG. 3B) and early-minus-late integrand value words 37 (FIG. 3B) in. order to accomplish method step 111 (FIG. 4B). Selection of table width, for example 16-bit or 32-bit, depends on the amount of memory available and other design decisions. If the table width is smaller than the number of bits in a value word, then multiple calls of the table are used in order to sum up the total number of one values in a given value word. Each call takes as input only a portion of the bits in the value word.
Continuing to refer primarily to FIGs. 3B and 4B, the accumulation operations of method steps 113 (FIG. 4B) and 115 (FIG. 4B) sum the one bit counts for each prompt integrand value word 27 (FIG. 3B) and for each early-minus-late integrand value word 37 (FIG. 3B) over the entire accumulation interval, multiply each result by the value 41 A (FIG. 3B) that is associated with the value word, and sum all of these scaled value accumulations to form the accumulations of equations (6) and (7), summed prompt accumulation 45 (FIG. 3B) and summed early-minus-late accumulation 47 (FIG. 3B). For example, the following computations can be performed to compute the in-phase summed prompt accumulation 45 in equation (6) as follows:
Nw Nw
Iχ/k ) = - Σ ONESCOUNT(-l)fo/ -2 ∑ ONESCOUNT(-2)/„/ 1=1 l=\
-3 ∑ ONESCOUNT(-3)/p; - 6 ∑ ONESCOUNT(-6)/p /=1 t=l
Nw Nw + ∑ ONESCOUNT(+l)//,/ + 2 ∑ ONESCOUNT(+2)//,/
+ 3 ∑ ONESCOUNT(-ι-3)/p/ + 6 ∑ ONESCOUNT(+6)/i? (30)
where / is the index of successive bit-wise parallel data words in the accumulation interval, Nw is the total number of data words in the interval, and ONESCOUNT(£)_p is the ones count for the corresponding value word 41 (FIG. 3B) associated with value k 41 A (FIG. 3B) for the /th data word interval and the in-phase summed prompt accumulation 45 (FIG. 3B). The quadrature summed prompt accumulations 45 (FIG. 3B) and the in-phase and
quadrature summed early-minus-late accumulations 47 (FIG. 3B) are calculated in a similar manner. The only difference is in the actual ONESCOUNT values used and, for the case of early-minus-late signals, the set of lvalues 41 A (FIG. 3B).
Continuing to refer primarily to FIGs. 4A and 4B, the method of the present invention can be adapted to work with a different number of bits in the representation of the RF front- end output and of the baseband mixed signals. An increase above two bits can make the logic more complex and may decrease the time savings over straight integer arithmetic. A decrease to a 1-bit representation can have the opposite effect. For example, if the RF front-end uses 1- bit digitization rather than 2-bit digitization while carrier replica signal 25 (FIG. 2B) retains its 2-bit digitization, then the operation count can decrease by a factor of almost two for the 1- bit method, which can make the logic execute about 4.2 times faster than straight integer
I arithmetic.
Returning to the discussion of determining PRN code, and now referring again FIGs. 2C, 2D, and 3A, the real-time generation of bit-wise parallel over-sampled prompt PRN code sign 29A (FIGs. 2D and 3 A), early-minus-late PRN code sign 35 A (FIGs. 2D and 3 A), and early-minus-late PRN code zero mask 35B (FIGs. 2D and 3 A) can be carried out by real-time over-sampled PRN code generator 30A (FIG. 3A). The inputs to this calculation are the actual PRN code chip length 65 (FIG. 2D), Δtc, the sample interval 63 (FIG. 2D), Ats, the nominal early-to-late code delay 61 (FIG. 2D), Δtgmi, the end time of the first code chip relative to the first sample time, or put another way, the time lag Δto, 67 (FIG. 2D) from the first RF sample time to the end time of the first prompt PRN code chip, and prompt code chips 91 (FIGs. 2D and 6). The outputs are the three integers that store the prompt PRN code sign 29A (FIGs. 2D and 3 A), early-minus-late PRN code zero mask 35B (FIGs. 2D and 3 ), and early-minus-late PRN code sign 35A (FIGs. 2D and 3 A), which are all in bit-wise parallel format.
Referring again to FIGs. 2C and 2D, table look-ups can be used to translate a PRN code and its timing information to bit-wise parallel representations of its over-sampled prompt and early-minus-late versions. - The required table look-ups can be simplified by recognizing that the following parameters are substantially constant, for the purposes of this calculation: sampling interval 63 (FIG. 2D), Δts, the nominal chip length , Δtcmn!, the early-minus-late
code delay 61 (FIG. 2D), Δtemι, used by software correlator 19 (FIG. 1), and the maximum number of chips that span a data word of microprocessor 16 (FIG 1). The difference between the actual chipping rate Δfc (reciprocal of Δtc) and the nominal chipping rate Δfcmm (reciprocal of Δtcnon that is used for the above simplification can be accommodated by correcting time lag 67 (FIG. 2D), Δto, for the average effects of Doppler shift, a procedure discussed later. Using the simplification, each look-up table has two variable inputs: the actual set of prompt code chips 91 (FIGs. 2D and 6) and time lag 67 (FIG. 2D), Δto. A table look-up procedure for each signal component yields a single integer result for prompt PRN code sign 29A (FIGs. 2D and 3 A), another single integer result for early-minus-late PRN code zero mask 35B (FIGs. 2D and 3A), and yet another single integer result for early-minus-late PRN code sign 35A (FIGs. 2D and 3A).
Time lag 67 (FIG. 2D), Δto, can take on an infinite number of values in the continuous range:
- Δteml < ) ≤
Δtc
( 1) This range's lower limit guarantees that the end time of the first late chip occurs no earlier than the first sample time. A lower time lag 67 (FIG 2D) Δto value would make the first chip irrelevant to the prompt PRN code 29 (FIG. 2D), early code 69B (FIG. 2D), and late code 69C (FIG. 2D) at all of the sample times. The upper limit in equation (31) guarantees that the start time of the first late chip occurs no later than the first sample. A larger value of Δto would leave the late code 69C (FIG. 2D) at the first sample time undefined based on the available code chips.
Referring now to FIG. 5, to create an electronically processable table, the continuous range of Δto values can be replaced with a discrete grid having m equally spaced points per sample interval 63 (FIG. 2D), Δts. The integer m is chosen to be large enough so that the granularity Δt m gives sufficient PRN code timing resolution. In GPS applications m is usually chosen to be large enough so that (cΔtJm) is on the order of several meters or less, where c is the speed of light, but reasonably sized because the table sizes are usually proportional to m. Given a choice of m, the grid of relative end times of the first prompt code period is:
kAt A = ~^i for k = k > - k (32) where the limits
provide foil coverage of the interval defined in equation (31). The floor() function rounds to the nearest integer in the direction of -co. This k
mi„ value can cause the minimum Δt
0Ato fall slightly below the lower limit in equation (31), which can cause memory inefficiency, but this value is advantageous because it may simplify some forther computations. The size for each table can be a function of the maximum number of code chips that may fall within a data, word's sample range. Given Δto, bit information for the following number of code chips is required in order for the prompt PRN code 29 (FIG. 2D), early code 69B (FIG. 2D), and late code 69C (FIG. 2D) to be folly specified at all of the data word's sample times:
where n
s is the number of data samples that can be stored in bit-wise parallel format in each word. It is clear from equation (34) that l(Δto) is a non-increasing function of Δto. Therefore, the maximum number of required chips occurs at the minimum value of Δto:
L = l(Δtokmin) (35) The size of each table can be determined from the parameters kmi„, kmax, and E. The grid contains kω - km∞rkmin+l) different time offsets of the first code chip. At each of these grid points there are 2L possible combinations of the code chips. Thus, each table optimally contains ktoty.2L entries, and each entry is optimally an unsigned integer in the range from 0 to' 2M'-1.
Continuing to refer to FIG. 5, each table can be stored as an array with a single index. The first 2L entries correspond to the 2L different possible chip sequences that can occur at Δto = **™ . the next 21 entries correspond to Δt0 = Δt0^kmin+ι-) , and so forth. The tabulated bit sequences for a fixed Δto are ordered by interpreting the sequence as a binary index counter with the first chip being the most significant counter bit and the EΛ chip being the least significant bit. The integer elements of the table can be the x(i) table elements 81 with corresponding code time offset 83 Δtok, and corresponding bit sequence 85 of the chips. The array index of a given x(ή table element 81 can be computed based on its code time offset 83 -dtofcgrid index k and its corresponding bit sequence 85. The corresponding bit sequence 85 associated with the array index consists of the chip values C(l), C(2), C(3), ... C(L). The C(j) chip values are either zero or one, with zero representing a -1 PRN code value and one representing a + 1 PRN code value, and they are listed in order of increasing time. The corresponding array index of the x(i) table element 81 is:
i[k,C(l),C(2),C(3),...,C(L)]^ l + (k-kmin) x 2L + ∑C(j)2L-J (36)
;=ι This equation can be inverted to give the code time offset 83 grid index k and the corresponding bit sequence 85 as functions of the x(i) table element 81 index i: = kmin + floor[(i - l)/2L] (37a) mod[(/ - l),2L]
C(j,i) = mod floor nL-j for/ = l, 2, 3, ..., Z (37b)
where modty^z) =y - zxfloor(y/z) is the usual remainder function. Continuing to refer to FIG. 5, the following computations generate the x(i) table elements 81 entries of the three tables. Given i, the corresponding code time offset 83 grid index k(i) is computed from equation (37a) and is used to generate three sequences of chip indices: k(i) At
s j («, ) = 2 + floor
■{ n-l - for H = 1, 2, 3, ..., n
s • (38a) m Δt
c
j
e(n,i) = 2 + floor\ t- 1- 3, ..., «
s (38b)
Jl(n,i) = 2 + floor n-1- for « = 1, 2, 3, ..., n
s (38c)
where « is the index of the sample time within the over-sampled data word. The integer jp(n,i) is the index of the PRN code chip that applies at sample n for the prompt PRN code 29 (FIG. 2D). The mtegersy
' e(rc,t) andj^n ) are defined similarly for the early code 69B (FIG. 2D) and late code 69C (FIG. 2D), respectively. The formulas in equations (38a)-(38c) amount to time measurements of each sample given in units of chip lengths past the first chip. These indices, in turn, can be used to determine the chip values that apply at the sample times: C
p(n,i) = C[j
p(n,i);i] for« = 1, 2, 3, .... n
s (39a) C
e(n,i)=- C[j
e(n,i);i] for« = 1, 2, 3, ..., n
s (39b)
C,(/ϊ,ϊ) = CU,(n,i);f\ for n = 1, 2, 3, ..., ns (39c) where Cp(n,i) is the over-sampled prompt PRN code 29 (FIG. 2D), and Ce(n,i) and Cχ»,7) are, respectively, the early code 69B (FIG. 2D) and late code 69C (FIG. 2D). Each of these code bit values is either zero or one, as dictated by the outer mod( ,2) operation in equation (37b). These over-sampled chip values can, in turn, be used to formulate tabulated functions xP(i), Xemh f), and xemns(j) that generate the unsigned integers that constitute the bit-wise parallel code representations of the three tables:
xp( ) = ∑Cp(n,i) x 2"' (40a)
XemlzmV) = ∑mod{[Ce(n,i) + C1(n,i)],2} 2n"' (40b)
«=1
Xemlls OO = ∑ mod{[Ce (n, i) + C/ (II, ι)l 2} x Ce (n, i) x 2n* n (40c)
»=1 where xp(i) is the entry of the prompt sign table, xe km( ) is the entry of the early-minus- late zero mask table, and xemi2s(i) is the entry of the early-minus-late 2's sign table. Note that the formula used in equation (40c) is only an example illustrative embodiment of the early- minus-late 2's sign table calculation. It places zeros in all of the X entries of early-minus-late
PRN code sign 35 A (FIG. 2D). There exist alternate formulas that are equally correct but that do not place zeros in the X entries.
The table layout in FIG. 5 is only an illustrative embodiment of how one can construct a table that can be used to translate PRN code chip values and timing information into data words that store the bit- wise parallel representations of the over-sampled prompt PRN code sign 29A (FIG. 2D), early-minus-late PRN code zero mask 35B (FIG. 2D), and early-minus- late PRN code sign 35 A (FIG. 2D). Other table layouts are also possible. Possible illustrative index calculations are described below for indexing into the tables for PRN code retrieval during accumulation calculations If another table layout is used, then different indexing calculations might be needed. Furthermore, different indexing calculations can be used even for the illustrative table layout shown in FIG. 5.
Referring now primarily to FIG. 6, accumulation calculations, as have been previously outlined herein and elsewhere, work with a fixed sequence of code chips. The prompt version of this sequence has a specified timing relationship to the incoming RF signal data 21 (FIG. 2A). This relationship can be pre-determined by a code search algorithm if software receiver 10 (FIG. 1) is in acquisition mode or by its delay-locked loop if it is in tracking mode. Software correlator 19 (FIG. 1) can calculate an accumulation using prompt code chips 91 (FIG. 6) C(l) through C(M). The timing of the prompt replicas of prompt code chips 91 (FI 6) can define the accumulation interval. The chip sequence starts at start lag 93 (FIG. 6) Δt
start seconds past the first sample of data word
95 (FIG. 6), it chips at the constant chipping rate/ = lΛdt
c, and it ends at end time 97 (FIG. 6), which occurs Δt
staXMΔt
c seconds after the first sample of data word
95 (FIG. 6). The end of the A/
11 prompt code chip can occur during data word W
N 99 (FIG 6), which implies that Δt
st∞t + MΔt
c
N = ceil (41) \ n„Δt„ where the ceilQ function rounds to the nearest integer towards +∞. Some of the initial bits of data word W\ 95 (FIG. 6) and some of the final bits of data word WN 99 (FIG. 6) may not be included in the accumulation. Let n^a be the number of initial bits of data word Wι 95 (FIG. 6) that are excluded, and let riaf be the number of final bits of data word WN 99 (FIG. 6) that are excluded. The timing relationship in FIG. 6 implies that these numbers are:
At
= ceill s,tart (42a)
At,
n^ =-nsN - ceil(Δt^ +t MAtA (42b)
These sample counts can be used to develop additional zero mask words that software correlator 19 (FIG. 1) uses to properly process the first and last data words during its bit-wise parallel accumulation calculations, as defined in A 12-Chωmel Real-Time GPS LI Software Receiver, B. M. Ledvina et al., Proceedings of the ION National Technical Meeting. January 22-24, 2003, Anaheim, CA. and Bit-Wise Parallel Algorithms for Efficient Software Correlation Applied to a GPS Software Receiver, B.M. Ledvina et al., to appear in the IEEE Transactions on Wireless Communications, 2003, both incorporated herein in their entirety by reference. Note that equations (41)-(42b) and all related timing considerations herein use the following code chip start/stop convention: a sample is correlated with a particular code chip if the start time of the code chip coincides exactly with the sample time, but it will not get correlated with that chip if its sample time coincides exactly with the end time of the code chip. Continuing to refer to FIG. 6, efficiently determining the correct xP(i), xemkm(J), and
X
emi
2s(i) bit-wise parallel code representations for the Ndata words
95 (FIG. 6) through data word W
N 99 (FIG. 6) involves making an efficient determination of the correct table index /
'„ that corresponds to data word W
v for κ= 1, ..., N, where the table index i
v is a function of start lag Δt
start 93 (FIG. 6), actual PRN code chip length Δt
c 65 (FIG. 6), v, and prompt code chips C(0), C(l), C(2), ..., C( +1) 91 (FIG. 6). The chip value C(0) 94 (FIG. 6) is needed in order to specify the late code 69C (FIG. 2D) at the initial few samples of the accumulation, and the chip value C(M+1) 96 (FIG. 6) is needed to specify the early code 69B (FIG. 2D) at the final few samples. Additional constants that can be used in order to determine the i
v indices are Δt
s, n
s, m, L, k
mi„, k
max, and nominal chip length Δt
cnom, which has been used to generate the three x(f) tables.
The first step of the index calculation procedure pre-computes and stores a table of candidate integers for the final summation term that appears on the right-hand side of equation (36). This table takes the form:
L τ .
Δi(μ) = ∑C(μ +j ~L-l) 2L~-> for μ= 1, 2, 3, ..., (M+L+Ϊ) (43)
/=!
This computation requires the undefined chip values C(-L+l), C(-L+2), C(-L+3), ..., C(- 1), and C( +2), C(M+3), C(M+4), ..., C(M+L). The value zero can be used for each of these undefined chips because they can affect the over-sampled codes only for the first ^ samples of data word
samples of data word WN 99 (FIG. 6), none of which are part of the accumulation. The table of equation (43) can be constructed by using the following iterative procedure:
Δi(ϊ) = C(0) (44a)
M(μ) = mod 2Δi(μ - Ϊ),2L] + C(μ - 1) for μ = 2, 3, 4, ..., (M+2) (44b) (μ) = mod 2.M(μ-\),2L for i = ( +3), (M+4), ..., ( +L+l) (44c)
Note that the mod(2 , ^) operation in the latter two equations can be replaced by a single truncated leftward bit shift.
In many cases prompt code chips 91 (FIG. 6) C(0), C(l), C(2), ... can be generated as the output of a feedback shift register or a system of such registers. For example, the new GPS civilian L2 signals can be generated this way. In this case, each iteration of equation (44b) can be interleaved with an iteration of the shift register calculations. Shift-register generation of PRN codes is well-known in the art.
An alternative to building up the previously-described table is to calculate the index component only for one data word at a time. Suppose that Δiv is the correct index component for data word W and that μv is the auxiliary index that would have been used to determine .4/ v from the Δi(μ) table had the table existed. In order to calculate zli i for data word ITv+i, μvt-i is computed (procedure defined herein), feedback shift register calculations that generate C(μ„), C(μv+\), C(μv+2), ..., C μv^ι -1) are iterated, and the resulting chip values are used to perform (μ^i - μ ) iterations of equations (44b) or (44c). Determination of the correct index into the xp(i), xemizm(i), and xemi2s(f) tables for data word Wv can be reduced to the determination of two quantities. One is the time offset index &vthat causes zlto/cvfrom equation (32) to match the true time offset for data word Wv as closely as possible. The other quantity is the auxiliary table index μv. It constitutes an index
for the sequence of actual code chips that are associated with data word Wv. Given these two quantities, the correct index for the three x(i) tables is iv = \ + (kv ~kmin) x2L +Ai(μv) for v= 1, 2, 3 ..., N (45)
The auxiliary index μv is determined by the position of the Wv data word relative to the PRN code chip sequence. Once that position has been ascertained, the index kv can be calculated from the position relative to the Wv samples of the L code chips that are associated with the index μv.
A time integer can keep track of the number of fine-scale time units in a given interval.
The fine-scale time unit is a small fraction of the sample interval 63 (FIG. 2D), Δt
s: Δt
f -= ^- (46) m
f where w/is the integer number of fine-scale time intervals per sample interval 63 (FIG. 2D), Δt
s. This number is chosen large enough, for example m > 2mN, to preclude any significant build-up of timing errors during an accumulation interval due to the finite time resolution Δtf. Nis the number of data words in the accumulation interval. The calculation of the k
v values over one accumulation interval involves approximately N iterative time increments, each of which has a resolution of Δtf. If /w/obeys the inequality given above, then the cumulative timing errors due to the finite precision ΔtfViiW be less than the timing error caused by the finite timing precision of the x(i) tables. Normally it is possible to make mf much larger than 2mN and still keep all of the relevant calculations within the size limits of a 32-bit signed integer. If mfis a power of two, a rightward bit shift operation can be used to implement integer division by mf. Time unit Δtf can be used to define an integer that approximately keeps track of the code/sample time offset
data word W
v:
Δt„ tnfAtn„ kfv ~ round(-^) = round( f v ) (47)
Atf Ats where the roundQ function rounds up or down to the nearest integer. The time lag 67 " (FIG. 2D), Δt0v, is the amount by which the end time of PRN code chip C(jUvL) lags the first sample time of data word Wv . The algorithm that iteratively determines kfv tries to keep the relationship in equation (47) exact, but using only integer operations can allow small errors to
build up. Note that kfjmf≡ kjm, as implied by a comparison of equations (32) and (47). This relationship can be used to determine kv from an iteratively determined kfV. Several constants are required by the iterative procedure that determines kfV, kv, and μv. The first five constants are used to account for the difference between the nominal chip length Δtcnom, used to generate the x(i) tables, and the actual chip length 65 (FIG. 2D), ltcused in the accumulation: kfinid - round[(ns - l)mf/2] (48a) λ = Δtc -Δtcnom
*C
fa -= round(λb
βx) (48e) where the signQ function returns +1 if its input argument is positive, zero if the argument is zero, and -1 if the argument is negative. The index kmt
d is approximately half the length of a data word as measured in units of Δt
f seconds. During an accumulation, the rational factor af
jJb
βX gets multiplied by the time offset between the end time of the first code chip and the midpoint of the data word. The result is a time perturbation that removes the average effect of the difference between the actual and nominal PRN code chipping rates. The time perturbation can be used to compute a corrected kf
V value:
Equation (48d) picks bf* to equal a power of two so that the integer division by bβx in equation (49) can be accomplished using a rightward bit shift operation. The roundQ operation in equation (49) can be accomplished as part of the division if one first adds siga.(afi
x)xbfiJ2 to the quantity
bit shift that constitutes division by bβ
X. This approach can give the correct k ψ- because the signs of (Jc
fmia- fcf
v) and bfl
x are both positive and because the rightward bit shift has the effect of rounding the
signed division result towards zero. An alternate implementation of the round function could be used for applications that do not guarantee kfmid > fy
v. Such applications are normally associated with L < 2 PRN code chips per data word.
Five additional constants can be used to define the kfv and μv iterations:
Lfyp - round(-^ — -) (50a)
Δtc mj-Δtc
Δkjc - round(- (50b) At, f»fLtypAtc
Akftyp = round( ) - nsmf (50c)
Δt,
kfmin (50d)
kfmax (50e)
The constant Ltyp is the typical number of code chips per data word. It is the nominal increment to //„ρer data word. The constant Δkf
c equals the number of fine-scale time intervals per PRN code chip. The constant Δk
β is used to adjust kf
V up or down if kf
V falls outside of the limits: kf
mi ≤ kf
v< kj
ma
x- The constant Δkf p is the nominal increment to kf
V per data word. The limits kfi„
m and kf
ma
x are approximately the limits k
mιn and k
max from equations (33a) and (33b) re-scaled to the new fine time scale and adjusted for the difference between the nominal code chipping rate of the x(i) tables and the actual chipping rate of the accumulation. The extra -2 term on the right-hand side of equation (33 a) is compensated for by the increment to k
mi„ on the right-hand side of equation (50d) and the decrement to k
max on the right-hand side of equation (50e). The original -2 term and the increment and decrement have been included because they ensure that &/ values which respect the limits in equation (50d) and (50e) are transformed into A: values that respect the limits in equations (33a) and (33b).
The iteration begins by initializing kf\ and μ
\ for the first data word. The nominal initial values are:
-Δt. λ start
Mltiom = fl oή + 1 + L (51b)
At c J
It is possible that kβ„
om from equation (51a) can violate its upper limit kf^
ax. Therefore, the following conditional adjustment can be implemented in order to finish the initialization. r _
\
kflnom ~ Akfc if
kfmax <
kβnom
Mir* f kflnom — "'/max μi=- (52b) Mlnom 1 n kfmax < kflnom
Given this initialization, the calculation of (kβ,μ
2), (kβ,μ
3), (kf,μX), ..., (kfN,μX) proceeds according to the following iteration: kfxmom
2,3,4,...,N (53a)
Mvnom = M(v-Ϊ) + Ltyp f°r - 2, 3, 4, ,N (53b)
kfvnom + kfi ttkfvnom < kfmm
«/„ — fvnom i "f k ' /m.mm < — k fm-vnom — <k "ft ax for v= 2, 3, 4, .., N (54a)
, k'"fvnom — Ak fc xfk /max < ^ k frvnom
Mvnom + 1 ** * wιow < ' min
My = i 'vnom if *.» ≤ £raom ≤ ^max for v= 2, 3, 4, ..., N (54b) r^vnom ~ ϊ */max < "yvnom
The table look-up calculations finish with the computation of kv, iv, and the actual table look-ups:
k
v = round
"mkfifixΦfv) for ι= 1, 2, 3..., N (55)
The roundQ operation in equation (55) can be implemented by adding rø/2 to before the rightward bit shift that constitutes division by mf. The result of the division will be the correct value of kv for any sign of kfyβXkfv) if the computer works with 2's
compliment notation for signed integers and if the rightward bit shift fills in from the left with the 2's compliment sign bit, i.e., with the left-most bit.
Given &„from equation (55) and _uvfrom equation (54b), one can use equation (45) to compute iv. This value, in turn, can be used to index into the tables to determine the Prompt PRN code sign 29A (FIGs. 2D and 3A), xpv> the early-minus-late PRN code zero mask 35B (FIGs. 2D and 3A), Λremfemv_ and the early-minus-late PRN code sign 35A (FIGs. 2D and 3 A), Xemi2sv, that correspond to data word Wv: xPv = x (iy) for v= 1, 2, 3 ..., N (56a)
Xembmv = Xemlzm(lv) for V= 1, 2, 3 ..., N (56b) Xeml2sv = Xeml2s(ϊv) for V= 1, 2, 3 ..., N (56c)
The conditionals in equations (54a) and (54b) can be reduced to a single conditional per data word during normal operation to improve efficiency. This can be done because the sign of kflyp in equation (53 a) is fixed for a given accumulation interval. (Normally the sign of Aktyp does not vary from accumulation interval to accumulation interval or from channel to channel for a given receiver because the only variable quantity that affects Δkβyp is actual chip length 65 (FIG. 2D), Δtc, which normally does not vary significantly.) If Δkjfyp < 0, then the proper formula for determining kfv and μv can be chosen by considering the inequality kfv„om
< kfimn. Conversely, i Δkfyφ > 0, then the proper formula can be determined by considering the inequality kfmom > kfi»∞. The decision about which condition to check can be made at the beginning of the accumulation because Δkfy,p is calculated prior to execution of the iteration in equations (53a)-(56c).
When using a processor that creates instruction pipelines, "if statements can disrupt the pipeline. In this case equations (54a) and (54b) can be replaced with the following computations: min[ , sign(kfmom - kfmin )] Δkftyp < 0
Vfv 0 Δkβyp = 0 for v= 2, 3, 4, ..., N (57a) max[Q, sign(kfmom - kfmax )] Δkftyp > 0 kfv = kfmom - η/yΔkβ for y= 2, 3, 4, ..., N (57b)
Mv = Mvnom ~ ηfv for "= 2> 3> 4> -» N (57c)
The minQ and maxQ functions return, respectively, the minimum or maximum of their two input arguments. The variable 77/ is normally zero, in which case equations (57b) and (57c) leave kfv equal to fmom and μv equal to μmom. The value of ψv is -1 if Akfyp < 0 and kfvnom < kfmm, and +1 if Akftyp > 0 and fvnom > kpnax- In both of these cases ψv causes equation (57b) and (57c) to perform the necessary adjustments to kfv and μv. Note that efficient code may not execute the conditional in equation (57a) once per data word. Instead, its accumulation iterations could be performed in one of three different iterative loops, depending on the value of Akftyp. Additional economies can be had in the first and third conditional clauses of equation (57a). The value of -77/yfor the first condition is equal to the sign bit of the 2's compliment representation of kfmom - kfintn. Similarly, +ηfV oτ the third condition is equal to the sign bit of the 2's compliment representation of kfinax - kfvnom- In either case, ηfV (or its negative) can be computed in two operations.
Summarizing real-time over-sampled PRN code generator 30A (FIG. 3A) and referring now to FIG. 1, to compute prompt PRN code 29 (FIG. 3 A) and early-minus-late PRN code 35 (FIG. 3 A) for an entire accumulation interval, the method includes the steps of iterating equations (44a)-(44c) (method step 201, FIG. 7) to construct the table of Δi(μ) values. The method forther includes the step of computing the auxiliary constants (method step 203, FIG. 7) in equations (48a)-(48e) and (50a)-(50e). The method forther includes the step of initializing kβ and μi (method step 205, FIG. 7) by evaluating equations (51a)-(52b). The method forther includes the step of iterating equations (53a), (53b), (57a)-(57c), (49), (55), and (45) (method step 207, FIG. 7) to compute, for each iteration, kfmom, μmom, Ηfv, kfv, μv, kv, and iv. The method forther includes the step of iterating equations (56a)-(56c) (method step 209, FIG. 7) to compute, for each iteration, xpv, Xemizmv, and xemi2sv. As mentioned already, it may prove efficient to interleave the equations (44a-c) iterations and the accompanying shift register iterations between the iterations that compute kfvnom through xemi2sv- In this scenario /κcan be computed from equation (57c). Afterwards, the shift register iterations that generate code chips C(βvi -1) though C(μv-Ϊ) can be
performed, and these chip values can be used to iterate equations (44a-c) from μ^ to μvin order to determine Δi(μv) from Ai(μ»ι).
The software correlator 19 (FIG. 1) of the present invention can advantageously be easily modified to work with signals at different frequencies, new PRN codes, or even signals for different types of devices. Thus, the same hardware could use the software correlator 19 (FIG. 1) to implement such devices as a GPS receiver, a cell phone, or both. To allow for new codes, new frequencies, and new types of functionality, small changes can be made in the software correlator 19 (FIG 1), or different versions of the software correlator 19 (FIG. 1) can be run on the same processor. The changes involve using a different baseband mixing frequency and a different PRN code in the correlation, and perhaps changes that would provide the new signals of interest to the software correlator 19 (FIG. 1). In order for the present invention to work with signals at different frequencies, new PRN codes, or signals for different devices, two fundamental changes need to be made. First, the baseband mixing frequency must be tailored to that of the signal data 21, which also involves pre-computing and storing sine and cosine tables at this new frequency. Second, new pre-computed over- sampled PRN code tables 28 (FIG. 3A) must be constructed. The size of the new tables should match the over-sampled accumulation period, or at least one over-sampled period of the PRN code. As an alternative to generating new pre-computed over-sampled PRN code tables 28 (FIG. 3 A), the new PRN codes can be generated in real-time by over-sampled PRN code generator 30A (FIG 3 A). Also, the system and method of the present invention could be implemented within systems such as GLONASS receivers, cell phones and cell base stations, pagers, wireless Ethernet (e.g. 802.11x standards), Bluetooth™, Blackberry® wireless internet devices, and satellite radio/phones (e.g. INMARSAT®). In fact, the system and method of the present invention are applicable to any sort of telecommunication system/device that uses spread spectrum, code division multiple access (CDMA) pseudo random number codes for the transmission of information, either wired or wireless.
Referring now to FIG. 8, navigation calculations require measured values of the PRN code phase 55, carrier phase, and carrier frequency.- The measurements for all tracked satellites must be taken at exactly the same time. A time interval counter (TIC) function provides a periodic timing scheme to synchronize these measurements at time tπc 56. At
time tTIC 56m the TIC fonction latches all of the PRN code phase 55, carrier phases, and carrier frequencies along with the code epoch counters, and software correlator 19 (FIG. 1) makes these available to application-specific code 15 (FIG. 1), for example, GPS receiver software. GPS receiver software uses the code phase and epoch counters to compute the pseudo range to each satellite. Software correlator 19 (FIG. 1) keeps track of the code and carrier phase of each signal as determined by the code chipping rate and the carrier Doppler shift inputs. The quantity / fc, the estimated code chipping rate of software receiver 10 for satellite j during its k PRN code period, can be determined either by an acquisition search procedure, or if tracking, by a delay-locked loop. Likewise, ώoo pj . he associated carrier Doppler shift, can be defined by an acquisition procedure or, if tracking has commenced, by a phase-locked loop or a frequency-locked loop. These determinations are made by application-specific code 15 (FIG. 1). Software correlator 19 (FIG. 1) can use these two frequencies to update quantities that keep track of its code and earner phases according to the formulas:
1023
£_*+! - £_*+• (58)
/ cj'
Φjk+l ~~ Φjk + β>D ' Xτjk+l ~ Tjk) (59)
Software correlator 19 (FIG. 1) can keep a running track of these quantities and can initialize these iterations as part of the signal acquisition calculations that it carries out in conjunction with application-specific code 15 (FIG. 1). The quantities TJQ and jς, are either sent to software correlator 19 by application-specific code 15, or they are initialized arbitrarily by software correlator 19 and application-specific code 15 executes feedback control of fcβ and &jβoppβ to force the sequences defined by equations (58) and (59) to converge to appropriate values. Information about the previously-described conventional method can be found in Dierendonck.
The TIC time tΩC 56 (FIG. 8) can occur at, for example, the millisecond boundaries of the receiver clock. At each time tΗC 56, the PRN code phase 55 (FIG. 8) of each signal is computed in the following manner:
where ψ
jΥIC is the PRN code phase 55 (FIG. 8) in chips of signal j at TIC time t
πc 56
(FIG. 8). The epoch counters, which are simply a running total of the number of code periods 57 (FIG. 8), are incremented at each code start/stop time.
The carrier phase calculation at time t
τlc 56 (FIG. 8) is similar to the PRN code phase 55 (FIG. 8) calculation: φ
jTJC —
~ Tjk+l) (61) where φ
ΎJC is the carrier phase at time t
nc 56. The Doppler shift that gets returned at time t
nc 56 (FIG. 8) is ώ
DomM .
With respect to the performance of the system and method of the present invention, a sample screen-shot from the illustrative embodiment of the present invention is provided in Table 8. This table illustratively shows the tracking of nine channels. The roof-mounted LI antenna of the illustrative embodiment can have a pre-amp with 26dB of gain. The software correlator 19 (FIG. 1) of the present invention can provide positional accuracy on the order of 10-15 meters when working in conjunction with application specific software 15 (FIG 1).
12 20 219 -3086 -3483 27.5 0.3 CCBF 9.6 -1.000
Table 8. Numerical GPS tracking data output by the illustrative embodiment. Two comparison tests illustrate the performance of the system and method of the present invention. In the first test, a first configuration includes a MITEL® GP2021 hardware correlator, but is in all other ways identical to a second configuration that includes the software correlator 19 (FIG. 1) of the present invention. The two configurations differ in SNR by less that ldB and in navigation solutions by no more than 5-10 meters. In the second test, timing studies using the system of the present invention show that processing six channels uses only about 20% of the processor's capacity, while Akos 2001a report a real-time software GPS receiver that would require 100 % of the capacity a 1.73 GHz microprocessor to implement a 6-channel GPS receiver when processing data from an RF front-end with a sampling frequency of 5.714 MHz.
Referring now to FIGs. 9A and 9B, among other indicators that could assess the accuracy of the PRN code generated by real-time dver-sampled PRN code generator 30A (FIG. 3A), which includes prompt PRN code sign 29A (FIG. 3A), early-minus-late PRN code sign 35A (FIG. 3 A), and early-minus-late PRN code zero mask 35B (FIG. 3 A), is the low distortion of the generated codes versus the true codes. FIG. 9A, generated for prompt code comparisons, shows juxtaposed plots of the autocorrelation fonction 80 (FIG. 9A) of the sampled true code and the cross correlation fonction 82 (FIG. 9A) between the sampled true code and the sampled code as generated from the new xp table. FIG. 9B shows a similar comparison for early-minus-late PRN code 35 (FIG. 3 A). In either case, the nominal chipping frequency is fcnθm = \IΔtcnom = 1.023 MHz, the sampling frequency is/ = \IΔts = 12.199 MHz, and the code timing resolution of the table is m = 12, i.e., l/12m of a sample. This resolution translates into approximately 1/143"* of a code chip. The actual chipping rate differs from the nominal by (fc -f om) = 1.5 Hz. This Doppler shift of the code chipping rate corresponds to a significant non-zero range rate between the receiver and the transmitter, 438 m/sec. The correlations are accumulated over 10230 code chips, and the code is a time-multiplexed version of a pair of the new GPS civilian L2 CL and CM codes. The data word indexing t c 5 calculations for this example use a code chip start time resolution of ΔtJIO , i.e., nif= 10 .
It is obvious from FIGs. 9A and 9B that prompt PRN code 29 (FIG. 3 A) and early- minus-late PRN code 35 (FIG. 3 A) distortion is very small. The correlations produced using the new tables are virtually identical to those produced from the exact code. The low distortion of the new approach is best characterized by two parameters: the amount by which the correlation peak of the prompt PRN code sign 29A (FIG. 3A) droops below one and the offset of the zero-crossing time of the early-minus-late PRN code 35 (FIG. 3A). The former metric characterizes the power loss of the new approach, and the latter metric characterizes the net timing error. The droop of the prompt peak is only 0.3% of the nominal amplitude, which translates into a 0.03 dB loss. The timing distortion of the new code is less than 4 l0"4 code chips. This distortion is very small; it translates into about 0.1 m of GPS range measurement error.
It may seem paradoxical that the code timing error is only 4x10"** code chips when the code timing granularity of the x(i) tables is ΔtJ(mΔtc) = 7xl0"3 code chips. The resolution of this paradox lies in the averaging effect of the accumulations. The length of a code chip equals 143.099269 code offset time grid intervals for the example shown in FIGs. 9A and 9B. The non-integer nature of this number causes the code offset errors of the x(i) tables to get dithered as the accumulation works its way through successive data words. This dithering tends to average out the table granularity errors, and this averaging can reduce the net timing error by an order of magnitude or more, as shown in FIG. 9B. Although the invention has been described with respect to various embodiments, it should be realized this invention is also capable of a wide variety of further and other embodiments.