WO1995032467A1 - Processor core which provides a linear extension of an addressable memory space - Google Patents

Processor core which provides a linear extension of an addressable memory space Download PDF

Info

Publication number
WO1995032467A1
WO1995032467A1 PCT/US1995/004668 US9504668W WO9532467A1 WO 1995032467 A1 WO1995032467 A1 WO 1995032467A1 US 9504668 W US9504668 W US 9504668W WO 9532467 A1 WO9532467 A1 WO 9532467A1
Authority
WO
WIPO (PCT)
Prior art keywords
bit
address
processor core
memory
execution unit
Prior art date
Application number
PCT/US1995/004668
Other languages
French (fr)
Inventor
Chaim Bendelac
Dan Biran
Ohad Falik
Gadi Erlich
Jonathan Levy
Gideon Intrater
Original Assignee
National Semiconductor Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National Semiconductor Corporation filed Critical National Semiconductor Corporation
Priority to EP95916414A priority Critical patent/EP0711435A1/en
Priority to KR1019960700389A priority patent/KR960704269A/en
Publication of WO1995032467A1 publication Critical patent/WO1995032467A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/06Addressing a physical block of locations, e.g. base addressing, module addressing, memory dedication
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/32Address formation of the next instruction, e.g. by incrementing the instruction counter
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/30Arrangements for executing machine instructions, e.g. instruction decode
    • G06F9/32Address formation of the next instruction, e.g. by incrementing the instruction counter
    • G06F9/321Program or instruction counter, e.g. incrementing

Definitions

  • the present invention relates to microprocessor architectures and, in particular to a processor core which provides a linear extension of an addressable memory space.
  • the number of memory space locations which a particular processor can address is limited by a number of factors.
  • One limiting factor is the number of bits available (typically 16 bits) within the processor s internal registers where memory space addresses are stored. Since the processor's stack is used for temporary storage of memory space location addresses the bit-width (typically 16 bits) of the processor s stack is also a limiting factor.
  • Available memory once allocated to either data or code, can not be reallocated. Furthermore, a particular data structure is limited in size to the amount of memory actually allocated to data, and no more.
  • a second prior art attempt to overcome the limiting factors makes use of software overlays, where code is stored on a secondary storage device, such as a disk drive, during execution of a program, and pieces of the program code are retrieved, as they are needed, from the secondary storage device into the memory. This method does not extend the linearity of the memory space.
  • An architecture which employs software overlays also has the disadvantage of requiring a secondary storage device.
  • the use of overlays adds significant program execution overhead to both manage the overlays and to retrieve the pieces of program code from the external storage device
  • a third prior art attempt to overcome the limiting factors employs memory segmentation.
  • the Intel 8086 microprocessor manufactured by Intel Corporation of Santa Clara, California, employs memory segmentation.
  • the Intel 8086 has 16-bit segment registers which provide base addresses for addressing memory segments. For each memory access, a 20-bit address is calculated by adding the contents of the appropriate segment register, multiplied by 16, to an offset specified in the instruction or in another register.
  • the memory segmentation approach has the disadvantage that it increases both software and hardware complexity. It also has the disadvantage that data structures are limited in size and/or must be located near the beginning of a segment, or additional software complexity is required to traverse segment boundaries.
  • a fourth prior art attempt to overcome the limiting factors uses bank switching
  • bits from a microprocessor s output port are used to enable separate memory banks or are decoded as the most significant address bits.
  • a problem with the use of bank switching to overcome memory space limiting factors is that bank switching adds significant hardware and software complexity.
  • the bank switching implementations also limit the size of a particular data structure to the size of a memory bank.
  • the present invention is a processor core which provides a linear extension of an addressable memory space of a microprocessor with minimal addmonal hardware and software complexity.
  • an N + x bit pomter register (e.g. a program counter) means of the processor core holds an N + x bit address.
  • the N + x bit address is a pomter to an information entry in the memory which is to be processed by an execution unit of the processor core.
  • An encoder encodes the N + x bit address into an N bit encoding of die N + x bit address.
  • processor core general register width and stack width are overcome, such that the processor core can address 2 X times more memory space locations than 2 N .
  • a processor core comprises at least two register means.
  • Each register means is for holding a portion of an address, where the address is used to form a pointer to a datum in memory to be operated on.
  • An address forming means concatenates the portions of the address in the register means to form the address. Therefore, the address is formed from portions of the address stored in multiple registers without performing any arithmetic on the portions.
  • Fig. 1 is a block diagram lllustrating a processor core embodying the present invention.
  • Fig. 2 illustrates the program counter register of the processor core of Fig 1.
  • Fig 3 is a table that provides sample assembly language instructions utilizable in accordance with the present invention.
  • Figs 4A-4E illustrate an embodiment of the present invention in the context of the sample assembly language lnstructions in Fig. 3.
  • Fig. 5 illustrates the format for encoding a BAL instruction to be executed by the processor core of Fig. 1.
  • Fig 6 illustrates a pair of registers for holding a pomter to a datum in memory.
  • Fig. 7 illustrates the format for encoding a LOAD/STORE instruction to be executed by the processor core of Fig 1.
  • Fig 8 illustrates the general purpose registers of the register file of the processor core of Fig. 1.
  • Fig. 1 shows a processor core 28 in accordance with the present invention, interfaced to an external memory 30.
  • Many of the elements shown in Fig. 1 are conventional microprocessor components arranged in a convenuonal manner. Those conventional components are not described in detail since both their structure and opera tion are well-known to those skilled in the art.
  • the processor core 28 further includes a 17 bit (i.e. N+x bit, where N is 16 and x is 1) program counter register 32, which contams a 17 bit (i.e. N+x bit) address in the memory 30 of the next instruction to be executed.
  • program counter register 32 To fetch an instruction to be executed from the memory 30, the contents of the program counter register 32 are padded with a constant zero as a most significant bit and placed onto the 18 bit address bus 34, and the instruction is loaded into an instruction loader 36 via a 16 bit data bus 35.
  • the program counter register 32 is shown in detail in Fig 2.
  • the N bit portion of the register is bits 2-17, with bit 2 being the least significant bit of the N bit portion and bit 17 being the most significant bit of the N bit pomon.
  • the x bit portion of the register is bit 1.
  • the arithmetic logic unit (“ALU") 38 computes, in N + x bit (i.e 17 bit) precision, the target address at which instruction execution is to continue and writes the target address to the program counter register 32 via a MUX 42 and a RESULT bus 44.
  • a shifter 40 encodes a copy of the 17 bit (i. e N + x bit) address of the next sequenual instruction (i.e the "return address", which is the contents of the program counter) into a 16 bit (i.
  • N bit (i. e N bit) encoded address by shifting off the x bit portion of the N + x bit address such that the N bit portion remains.
  • the N bit (i. e. 16 bit) encoded return address is saved to an N bit (i. e .16 bit) wide stack space, which may be. for example, either in the external memory 30 or internal to the processor core 28 Altern atively, the encoded return address may be saved to one of the general registers in the register file 38, which are also N bits (i.e. 16 bits) wide.
  • the processor core 28 in accordance with the present invention can address 2 times (i.e 2 X times) as many memory locations (i.e 2 X nmes 2 N ) -but the components of the processor core 28 which are used to store addresses (i.e general purpose registers in the register file 31, stack areas in memory 30 or m the register file 31, etc) need to be only 16 bits (i.e. N bits).
  • the address of any instruction to be executed by the processor core 28 has least significant 1 bit (i.e. x bits) of zero.
  • the lnstructions can be aligned on double-word boundaries.
  • the address is encoded by saving only the most significant 16 bits (i.e. N bits) of the 18 bit (i.e. N+x bit) address That is. since the least significant 2 bits (i.e x bits) of the 18 bit (i.e. N +x bit) address are always zero, the least significant 2 bits (i.e x bits) need not be represented in the encoded address.
  • instructions can be aligned on addresses that, although not divisible by 2 X , have identical remainders when divided by 2 X , thus ensuring that the address of any instruction to be executed has identical least significant x bits.
  • a return address is encoded by saving only the most sig nificant N bit of the N+x bit address.
  • the return address written to the program counter register 32 is assumed to have least sigmficant bits equal to the aforementioned identical remainders.
  • the instruction address store/restore mechanism in accordance with the above-described preferred embodiment, is illustrated in the context of several jump/branch with linkage instructions, shown in assembly language form in Fig. 3.
  • the foregoing illustrations assume that the instruction stored in the memory 30 are all aligned on address boundaries divisible by 2 (i.e. by 2 X ) such that the address of any instriction to be executed by the processor core 28 has least significant 1 bit (i.e. x bits) of zero.
  • a branch and link instruction 50 which is of the form "BAL link, dest.” provides an illustration of the address store mechanism.
  • the 17 bit (i.e. N+x bit) return address is stored, from the ALU 38, in encoded form, into the 16 bit (i.e. N bit) register specified by the link field in the instruc tion 50.
  • the displacement to the instruction at which program execution is to continue may be up to 17 bits (i.e. N+x bits).
  • Fig. 5 shows the encoding of a BAL instruction. As shown in Fig. 5, only the most sig nificant 16 bits (i.e. N bits) of the displacement are encoded directly into the lnstr uction.
  • the actual displacement is obtained by shifting the encoded 16 bit (i.e. N bit) portion of the displacement left by 1 bit (i.e. x bits).
  • the address at which program execu tion is to continue is obtained by adding the actual displacement to the address contained in the program counter register.
  • Fig. 4A illustrates an execution of the BAL instruction.
  • the return address is 2909C 16 , which is the address of the current (bal) instruction, 29098 16 . plus four, the number of bytes in the current instruction.
  • Bits 2-17 of the return address i.e. the most sig nificant N bits
  • execunon control is passed to the dest instruction, labelled L, by adding 00F6C 16 , which is the offset to L from the address of the BAL instruction 50, to the current PC.
  • EXCP instruction 52 which is of the form "EXCP vector.” also provides an illustration of the address store mechanism. Before the EXCP instruction activates the trap specified by the vector operand, the return address, in encoded form, is pushed onto the 16 bit (i.e. N bit) wide interrupt stack.
  • the Jcond instruction 54 which is of the form " Jcond dest," provides an lllustration of the address restore mechanism
  • a two character condition code that describes a flag or flags in a processor status register whose state is to be tested is specified in the cond field of the instruction 54 If the condition specified bv the cond field is true, program execution continues at a destination address specified in encoded form in the 16 bit (i e N bit) dest register by resto ring the encoded address from the dest register into the most significant 16 bits (i e N bits) of the program counter register. The least significant bit (i. e. x bits) of the program counter register are cleared to zero .
  • Fig 4B illustrates an execution of the Jcond instruction Refernng to the "jlo r3" (jump if lower than) instruction shown in Fig 4B. which checks the state of the processor status register flags PSR.Z and PSR. L. If the processor status register flags PSR Z and PSR. L are zero, this instruction restores the destination address held in R3 (19004 16 , encoded as OC802 16 ) into bits 2-17 of the program counter register, clearing bit 1 of the program counter register Program execution then continues at the destination address.
  • the JAL instruction 56 which is of the form "JAL link, dest.' provides illustrations of both the store and restore mechanisms Program execution continues at the destination address encoded in the 16-bit dest register, by restoring the encoded desstination address from the dest register into the program counter register and cleanng the least significant bit of the program counter register The return address is stored in encoded form in the link register.
  • Fig.4C illustrates an example of the JAL instruction Referring to the "jal r15, r3" instruction shown in Fig 4C, this example loads the destination address held in register R3 (19004 16 , encoded as C802 16 ) into bits 2-17 of the program counter register and clears bit 1 of the program counter register
  • the return address is 0909A 16 , which is the address of the current instruction (JAL), 09098 16 , plus two, the number of bytes in the current lnstruction Bits 2-17 of the return address are saved in register R15.
  • execution control is passed to the instruction at 19004 16 .
  • the JUMP instruction 58 which is of the form "JUMP dest,” provides an illustration of the restore mechanism. Execution contmues at the encoded destination address held in the dest register by restoring the encoded address from the dest register into the program counter register and cleanng the least sigmficant bit of the program counter register.
  • Fig.4D shows an example of the JUMP instruction. Referring to the jump r3 instruction shown in Fig 4D, this example loads the destination address, held in encoded form in register R3. into bits 2-17 of the program counter register, cleanng bit 1 of the program counter register Program execution then continues at the destination address.
  • the RETX instruction 60 provides an example of the restore mechanism.
  • the RETX instruction returns control from a trap service procedure by continuing execution continues at the return address held in encoded form at the top of the interrupt stack
  • the encoded return address is popped from the interrupt stack and loaded it into the 16 most significant bits of the program counter register The least significant 1 bit of the program counter register are cleared Execution continues at the return address.
  • Fig. 4E shows an example of the RETX instruction Refernng to the retx instruction shown in Fig 4E, this example loads the return address (19004 16 , encoded as C802 16 ), from its encoded form on the interrupt stack into bits 2-17 of the program counter register, cleanng bit 1 of the program counter register Program execution then continues at 19004 16 , the return address
  • two general registers within the register file 31 are concatenated to form a po inter to a datum in memory from the concatenated contents of the two registers That is.
  • RO holds 16 bits (i.e N bits, bits 1,
  • R1 holds 2 bits (i. e. y bits, bits N + 1 , N+y)
  • a programmer writes bits 1, 2,.... N (i e N bits) of the 18 bit (i.e N+y bit) address to R0 and bits N+ 1, N+y (i.e. y bits) of the 18 bit (i.e N+y bit) address to R1.
  • the 16 bits (i.e N bits) from RO are put onto the least significant 16 bits of the ADDRESS bus 34
  • the 2 bits (i.e y bits) from R1 are put onto the most significant 2 bits of the ADDRESS bus 34 That is.
  • Fig 7 illustrates the format or an instruction (LOAD/STORE) which utilizes an 18 bit (i e N+y bit) data reference Bit 14 of the instruction indicates whether the instruction is a "Load or a "Store instruction Bits 5-8 indicate a register which holds the 16 bit (i e N bit) portion of the base address of a pointer to a data reference the next consecutively numbered register holds the 2 bit (i. e. y bit) portion of the base address of the pointer to the data reference Bits 9-10 and 16-31 hold the 18 bits (i. e. N+y bits) of the displacement of the pointer
  • the complete address of the pointer is formed by adding the base address to the displacement.
  • Fig.8 shows a preferred embodiment of the present invention register concatenation embodiment of Fig 6
  • the registers R0 and R1 are each 16 bits (i. e. N bits)
  • register R0 actually has 18 bits, the two most significant bits are inaccessible to the programmer directly
  • the 16 bit (i. e. N bit) portion of the address is written to R0 (i. e. the least significant 16 bits of R0) via the write bus 50.
  • the 2 bit i. e.
  • R0 contains the complete 18 bit (i. e. N+y bit) pointer value
  • the 18-bit pointer value is read directly out of the 18-b ⁇ t register R0 on an 18- bit read bus 52. There is no need to access R1 at all, nor is there need to combine the contents of multiple registers to form the 18-bit pointer value.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Executing Machine-Instructions (AREA)

Abstract

A processor core for providing a linear extension of addressable memory space of a microprocessor with minimal additional hardware and software complexity. An N+x bit pointer register (e.g. program counter) holds an N+x bit instruction address. The N+x bit instruction address provides to an execution unit a pointer to an instruction in the memory to be processed by the execution unit. An encoder encodes the N+x bit address into an N bit encoding of the N+x bit address. The processor core can thereby address 2x times more memory locations than 2N. Two other registers each hold a portion of a data address (i.e. a pointer to a datum in memory to be operated on). An address former concatenates the portions of the address in the two registers to form the data address. Therefore, the address is formed from portions of the data address stored in multiple registers without performing any arithmetic on the portions.

Description

PROCESSOR CORE WHICH PROVIDES A LINEAR EXTENSION OF
AN ADDRESSABLE MEMORY SPACE
COPYRIGHT NOTICE
A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner National Semiconductor Corporation has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever. TECHNICAL FIELD OF THE INVENTION
The present invention relates to microprocessor architectures and, in particular to a processor core which provides a linear extension of an addressable memory space.
BACKGROUND OF THE INVENTION
With microprocessor architectures, the number of memory space locations which a particular processor can address is limited by a number of factors. One limiting factor is the number of bits available (typically 16 bits) within the processor s internal registers where memory space addresses are stored. Since the processor's stack is used for temporary storage of memory space location addresses the bit-width (typically 16 bits) of the processor s stack is also a limiting factor. Another limiting factor is the width of the processor s Arithmetic Logic Unit ("ALU"), which calculates memory space addresses. Where storage of addresses is to be in 16 bits, the processor has a limited addressable linear memory space of 2**16 = 256k locations.
A first prior an attempt to overcome these limiting factors is known as the Harvard Architecture, which employs separate code and data memory spaces, effectively doubling the number of addressable memory locations. This architecture has the disadvantage that it makes inefficient use of memory.
Available memory, once allocated to either data or code, can not be reallocated. Furthermore, a particular data structure is limited in size to the amount of memory actually allocated to data, and no more.
A second prior art attempt to overcome the limiting factors makes use of software overlays, where code is stored on a secondary storage device, such as a disk drive, during execution of a program, and pieces of the program code are retrieved, as they are needed, from the secondary storage device into the memory. This method does not extend the linearity of the memory space. An architecture which employs software overlays also has the disadvantage of requiring a secondary storage device. Furthermore, the use of overlays adds significant program execution overhead to both manage the overlays and to retrieve the pieces of program code from the external storage device
A third prior art attempt to overcome the limiting factors employs memory segmentation. For example, the Intel 8086 microprocessor, manufactured by Intel Corporation of Santa Clara, California, employs memory segmentation. The Intel 8086 has 16-bit segment registers which provide base addresses for addressing memory segments. For each memory access, a 20-bit address is calculated by adding the contents of the appropriate segment register, multiplied by 16, to an offset specified in the instruction or in another register. The memory segmentation approach has the disadvantage that it increases both software and hardware complexity. It also has the disadvantage that data structures are limited in size and/or must be located near the beginning of a segment, or additional software complexity is required to traverse segment boundaries.
Furthermore, the use of more than one register, where it is necessary to perform arithmetic on the contents of the registers to form an address, is relatively slow.
A fourth prior art attempt to overcome the limiting factors uses bank switching In one implementation of bank switching, bits from a microprocessor s output port are used to enable separate memory banks or are decoded as the most significant address bits. A problem with the use of bank switching to overcome memory space limiting factors is that bank switching adds significant hardware and software complexity. The bank switching implementations also limit the size of a particular data structure to the size of a memory bank. SUMMARY OF THE INVENTION
The present invention is a processor core which provides a linear extension of an addressable memory space of a microprocessor with minimal addmonal hardware and software complexity.
In accordance with a first embodiment of the present invention, an N + x bit pomter register (e.g. a program counter) means of the processor core holds an N + x bit address. The N + x bit address is a pomter to an information entry in the memory which is to be processed by an execution unit of the processor core. An encoder encodes the N + x bit address into an N bit encoding of die N + x bit address.
As a result of the address encoding, the limiting factors of processor core general register width and stack width are overcome, such that the processor core can address 2X times more memory space locations than 2N.
In accordance with a further embodiment of the present invention, a processor core comprises at least two register means. Each register means is for holding a portion of an address, where the address is used to form a pointer to a datum in memory to be operated on. An address forming means concatenates the portions of the address in the register means to form the address. Therefore, the address is formed from portions of the address stored in multiple registers without performing any arithmetic on the portions.
A better understanding of the features and advantages of the invention will be obtained by reference to the following detailed description and accompanying drawings which set forth an illustrative embodiment in which the principles of the invention are utilized.
BRIEF DESCRIPTION OF THE FIGURES
Fig. 1 is a block diagram lllustrating a processor core embodying the present invention.
Fig. 2 illustrates the program counter register of the processor core of Fig 1.
Fig 3 is a table that provides sample assembly language instructions utilizable in accordance with the present invention.
Figs 4A-4E illustrate an embodiment of the present invention in the context of the sample assembly language lnstructions in Fig. 3.
Fig. 5 illustrates the format for encoding a BAL instruction to be executed by the processor core of Fig. 1.
Fig 6 illustrates a pair of registers for holding a pomter to a datum in memory.
Fig. 7 illustrates the format for encoding a LOAD/STORE instruction to be executed by the processor core of Fig 1.
Fig 8 illustrates the general purpose registers of the register file of the processor core of Fig. 1.
DETAILED DESCRIPTION OF THE INVENTION
Fig. 1 shows a processor core 28 in accordance with the present invention, interfaced to an external memory 30. Many of the elements shown in Fig. 1 are conventional microprocessor components arranged in a convenuonal manner. Those conventional components are not described in detail since both their structure and opera tion are well-known to those skilled in the art.
Linear Extension of Instruction Space
The processor core 28 includes an 18 bit address bus 34, which allows addressing of 218 = 256k memory locations. The processor core 28 further includes a 17 bit (i.e. N+x bit, where N is 16 and x is 1) program counter register 32, which contams a 17 bit (i.e. N+x bit) address in the memory 30 of the next instruction to be executed. To fetch an instruction to be executed from the memory 30, the contents of the program counter register 32 are padded with a constant zero as a most significant bit and placed onto the 18 bit address bus 34, and the instruction is loaded into an instruction loader 36 via a 16 bit data bus 35.
The program counter register 32 is shown in detail in Fig 2. The N bit portion of the register is bits 2-17, with bit 2 being the least significant bit of the N bit portion and bit 17 being the most significant bit of the N bit pomon. The x bit portion of the register is bit 1.
When the to be executed in struction is a jump/branch instruction, the arithmetic logic unit ("ALU") 38 computes, in N + x bit (i.e 17 bit) precision, the target address at which instruction execution is to continue and writes the target address to the program counter register 32 via a MUX 42 and a RESULT bus 44. In cases of executing jump/branch instructions with linkage (i.e for function or subroutine calls), a shifter 40 encodes a copy of the 17 bit (i. e N + x bit) address of the next sequenual instruction (i.e the "return address", which is the contents of the program counter) into a 16 bit (i. e N bit) encoded address by shifting off the x bit portion of the N + x bit address such that the N bit portion remains. The N bit (i. e. 16 bit) encoded return address is saved to an N bit (i. e .16 bit) wide stack space, which may be. for example, either in the external memory 30 or internal to the processor core 28 Altern atively, the encoded return address may be saved to one of the general registers in the register file 38, which are also N bits (i.e. 16 bits) wide.
As compared to a conventional processor core which has a 16 bit (i.e. N bit) program counter register and a 16 bit (i.e. N bit) address bus, the processor core 28 in accordance with the present invention can address 2 times (i.e 2X times) as many memory locations (i.e 2X nmes 2N) -but the components of the processor core 28 which are used to store addresses (i.e general purpose registers in the register file 31, stack areas in memory 30 or m the register file 31, etc) need to be only 16 bits (i.e. N bits).
By aligning all instructions stored in the memory 30 on addresses divisible by 2 (i.e. 2X), it can be ensured that the address of any instruction to be executed by the processor core 28 has least significant 1 bit (i.e. x bits) of zero. As another, hypothetical, example, in a byte-addressable architecture, the lnstructions can be aligned on double-word boundaries. Thus, in such an operation of the present invention, whenever a copy of a return address is saved, the address is encoded by saving only the most significant 16 bits (i.e. N bits) of the 18 bit (i.e. N+x bit) address That is. since the least significant 2 bits (i.e x bits) of the 18 bit (i.e. N +x bit) address are always zero, the least significant 2 bits (i.e x bits) need not be represented in the encoded address.
In a complementary fashion, when the 17 bit (i.e N + x bit) program counter register 32 contents is restored from a 16 bit (i.e. N bit) encoded representation, the instruction address written to the program counter register 32 is shifted to the left, resulting in a least sig nificant 1 bit (i.e. x bits) of zero.
Alternatively, instructions can be aligned on addresses that, although not divisible by 2X, have identical remainders when divided by 2X, thus ensuring that the address of any instruction to be executed has identical least significant x bits. In this case, also, a return address is encoded by saving only the most sig nificant N bit of the N+x bit address. However, when the N + x bit program counter register 32 contents is restored from an N bit encoded representation, the return address written to the program counter register 32 is assumed to have least sigmficant bits equal to the aforementioned identical remainders.
With reference to Figs. 4A-4E, the instruction address store/restore mechanism, in accordance with the above-described preferred embodiment, is illustrated in the context of several jump/branch with linkage instructions, shown in assembly language form in Fig. 3. The foregoing illustrations assume that the instruction stored in the memory 30 are all aligned on address boundaries divisible by 2 (i.e. by 2X) such that the address of any instriction to be executed by the processor core 28 has least significant 1 bit (i.e. x bits) of zero.
First, a branch and link instruction 50, which is of the form "BAL link, dest." provides an illustration of the address store mechanism. The 17 bit (i.e. N+x bit) return address is stored, from the ALU 38, in encoded form, into the 16 bit (i.e. N bit) register specified by the link field in the instruc tion 50. The displacement to the instruction at which program execution is to continue may be up to 17 bits (i.e. N+x bits). Fig. 5 shows the encoding of a BAL instruction. As shown in Fig. 5, only the most sig nificant 16 bits (i.e. N bits) of the displacement are encoded directly into the lnstr uction. The actual displacement is obtained by shifting the encoded 16 bit (i.e. N bit) portion of the displacement left by 1 bit (i.e. x bits). The address at which program execu tion is to continue is obtained by adding the actual displacement to the address contained in the program counter register.
Fig. 4A illustrates an execution of the BAL instruction. Referring to the "bal r15, L" instruction 50 shown in Fig. 4A, the return address is 2909C16, which is the address of the current (bal) instruction, 2909816. plus four, the number of bytes in the current instruction. Bits 2-17 of the return address (i.e. the most sig nificant N bits) are saved in the 16 bit (i.e. N bit) link register R15. and execunon control is passed to the dest instruction, labelled L, by adding 00F6C16, which is the offset to L from the address of the BAL instruction 50, to the current PC.
An EXCP instruction 52. which is of the form "EXCP vector." also provides an illustration of the address store mechanism. Before the EXCP instruction activates the trap specified by the vector operand, the return address, in encoded form, is pushed onto the 16 bit (i.e. N bit) wide interrupt stack. The Jcond instruction 54 which is of the form " Jcond dest," provides an lllustration of the address restore mechanism A two character condition code that describes a flag or flags in a processor status register whose state is to be tested is specified in the cond field of the instruction 54 If the condition specified bv the cond field is true, program execution continues at a destination address specified in encoded form in the 16 bit (i e N bit) dest register by resto ring the encoded address from the dest register into the most significant 16 bits (i e N bits) of the program counter register. The least significant bit (i. e. x bits) of the program counter register are cleared to zero .
Fig 4B illustrates an execution of the Jcond instruction Refernng to the "jlo r3" (jump if lower than) instruction shown in Fig 4B. which checks the state of the processor status register flags PSR.Z and PSR. L. If the processor status register flags PSR Z and PSR. L are zero, this instruction restores the destination address held in R3 (1900416, encoded as OC80216) into bits 2-17 of the program counter register, clearing bit 1 of the program counter register Program execution then continues at the destination address.
The JAL instruction 56 which is of the form "JAL link, dest.' provides illustrations of both the store and restore mechanisms Program execution continues at the destination address encoded in the 16-bit dest register, by restoring the encoded desstination address from the dest register into the program counter register and cleanng the least significant bit of the program counter register The return address is stored in encoded form in the link register.
Fig.4C illustrates an example of the JAL instruction Referring to the "jal r15, r3" instruction shown in Fig 4C, this example loads the destination address held in register R3 (1900416, encoded as C80216) into bits 2-17 of the program counter register and clears bit 1 of the program counter register The return address is 0909A16, which is the address of the current instruction (JAL), 0909816, plus two, the number of bytes in the current lnstruction Bits 2-17 of the return address are saved in register R15. Finally, execution control is passed to the instruction at 1900416.
The JUMP instruction 58, which is of the form "JUMP dest," provides an illustration of the restore mechanism. Execution contmues at the encoded destination address held in the dest register by restoring the encoded address from the dest register into the program counter register and cleanng the least sigmficant bit of the program counter register.
Fig.4D shows an example of the JUMP instruction. Referring to the jump r3 instruction shown in Fig 4D, this example loads the destination address, held in encoded form in register R3. into bits 2-17 of the program counter register, cleanng bit 1 of the program counter register Program execution then continues at the destination address.
The RETX instruction 60 provides an example of the restore mechanism. The RETX instruction returns control from a trap service procedure by continuing execution continues at the return address held in encoded form at the top of the interrupt stack When the RETX instruction is executed, the encoded return address is popped from the interrupt stack and loaded it into the 16 most significant bits of the program counter register The least significant 1 bit of the program counter register are cleared Execution continues at the return address.
Fig. 4E shows an example of the RETX instruction Refernng to the retx instruction shown in Fig 4E, this example loads the return address (1900416, encoded as C80216), from its encoded form on the interrupt stack into bits 2-17 of the program counter register, cleanng bit 1 of the program counter register Program execution then continues at 1900416, the return address
Linear Extension of Data Space
In accordance with a further embodiment of the present invention, two general registers within the register file 31 (two are shown in Fig 6, R0 and R1) are concatenated to form a po inter to a datum in memory from the concatenated contents of the two registers That is. RO holds 16 bits (i.e N bits, bits 1,
2 N) and R1 holds 2 bits (i. e. y bits, bits N + 1 , N+y) To form an 18 bit (i.e N+y bit) pointer to a data item, a programmer writes bits 1, 2,.... N (i e N bits) of the 18 bit (i.e N+y bit) address to R0 and bits N+ 1, N+y (i.e. y bits) of the 18 bit (i.e N+y bit) address to R1. Then, to form an 18 bit address, the 16 bits (i.e N bits) from RO are put onto the least significant 16 bits of the ADDRESS bus 34, and the 2 bits (i.e y bits) from R1 are put onto the most significant 2 bits of the ADDRESS bus 34 That is. the 18 bit address is formed by concatenating the contents of R0 and R1 Fig 7 illustrates the format or an instruction (LOAD/STORE) which utilizes an 18 bit (i e N+y bit) data reference Bit 14 of the instruction indicates whether the instruction is a "Load or a "Store instruction Bits 5-8 indicate a register which holds the 16 bit (i e N bit) portion of the base address of a pointer to a data reference the next consecutively numbered register holds the 2 bit (i. e. y bit) portion of the base address of the pointer to the data reference Bits 9-10 and 16-31 hold the 18 bits (i. e. N+y bits) of the displacement of the pointer The complete address of the pointer is formed by adding the base address to the displacement.
Fig.8 shows a preferred embodiment of the present invention register concatenation embodiment of Fig 6 In accordance with the preferred embodiment, while to a programmer it appears that the registers R0 and R1 are each 16 bits (i. e. N bits), register R0 actually has 18 bits, the two most significant bits are inaccessible to the programmer directly As with the previously described embodiment, the 16 bit (i. e. N bit) portion of the address is written to R0 (i. e. the least significant 16 bits of R0) via the write bus 50. When the 2 bit (i. e. y bit) portion of the address is written into R1 via the write bus 50, the 2 bits are also written into the most sigmficant 2 bits of R0 via a 2-bit portion 54 of the write bus 50 Thus. R0 contains the complete 18 bit (i. e. N+y bit) pointer value
When an indirect memory reference is made by the programmer designating the R0/R1 pair as containing the pointer value, the 18-bit pointer value is read directly out of the 18-bιt register R0 on an 18- bit read bus 52. There is no need to access R1 at all, nor is there need to combine the contents of multiple registers to form the 18-bit pointer value.
It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention It is intended that the following claims define the scope of the invention and that methods and apparatus within the scope of these claims and their equivalents be covered thereby .

Claims

WHAT IS CLAIMED IS
1. A processor core that retrieves information from a memory and processes the information.
comprising:
(a) an execution unit that processes information:
(b) an N + x bit pointer register means for holding one of a plurality of N + x addresses. wherein said N + x bit address provides to the execution unit a pointer to an information entry in the memory to be processed by the execution unit: and
(c) an encoding means for encoding said N + x bit address into an N bit encoding of said N +x bit address
whereby the processor core can address 2X times more memory locations than 2N.
2. A processor core as in claim 1, further comprising:
(d) a linkage register means for holding said N bit encoding of said N +x bit address.
3. A processor core as in claim 2, further comprising:
(e) a decoding means for decoding said N + x bit address from said N bit encoding of said N + x bit address.
4. A processor core that processes information retrieved from a memory, comprising:
(a) an execution unit diat processes information:
(b) a decoding means for decoding an N bit encoding of one of a plurality of N + x bit addresses into said N + x bit address;
(c) an N + x bit pointer register means for holding said N + x bit addresses wherein said N + x bit address provides to the execution unit a pointer to an information entry in the memory to be processed by the execution unit
whereby the processor core can address 2x times more memory locations than 2N.
5. A processor core that executes lnstrυctions retrieved from a memory, comprising:
(a) an execution unit that executes instructions,
(b) an N + x bit program counter register for holding one of a plurality of N + x bit addresses, wherein said N + x bit address provides to said execution unit a pointer to an instruction in the memory to be executed by the execution unit, and
(c) an encoding means for encoding said N + x bit address into an N bit encoding of said N + x bit address
whereby the processor core can address 2X times more memory locations man 2N.
6. A processor core as in claim 5, further comprising:
(d) a linkage register means for holding said N bit encoding of said N + x bit address.
7. A processor core as in claim 6, further comprising:
(e) a decoding means for decoding said N+x bit address from said N bit encoding of said N+x bit address.
8. A processor core as in claim 7, wherein said N bit encod ing of said N+x bit address is the N most significant bits of said N+x bit address.
9. A processor core as in claim 8 wherein the least significant x bits of said N-bit address is a priori determined to be zero.
10. A processor core that executes instructions retrieved from a memory comprising :
(a) an execution unit that processes lnstructions:
(b) a decoding means for decoding an N bit encoding of one of a plurality of N+x bit addresses into said N+x bit address; (c) an N + x bit pointer register means for holding said N + x bit address wherein said N+x bit address provides to die execution unit a pointer to an instruction in the memory to be retrieved from memory and executed by the execution unit
whereby the processor core can address 2X times more memory locations than 2N.
11. A method of retrieving information from a memory to a processor core, to be processed by an execution unit of the processor core, comprising:
(a) holding one of a plurality of N + x bit addresses in an N+x bit pointer register, wherein said N + x bit address provides to the execution unit a pointer to an information entry in the memory to be processed by die execution unit; and
(b) encoding said N + x bit address into an N bit encoding of said N+x bit address
whereby the processor core can address 2x times more memory locations than 2N.
12. The method of claim 11, further comprising:
(c) holding said N bit encoding of said N+x bit address.
13. The method of claim 12, further comprising:
(d) decoding said N + x bit address from said N bit encoding of said N+x bit address.
14. A method of retrieving information from a memory to a processor core, to be processed by an execution unit of the processor core, compnsing:
(a) decoding an N bit encoding of one of a plurality of N+x bit addresses into said N+x bit address, wherein said N + x bit address provides to the execution unit a pointer to an information entry in the memory to be processed by the execution unit, and
(b) holdmg said N+x bit address
whereby die processor core can address 2X times more memory locations than 2N.
15. A processor core that operates on data retrieved from a memory, comprising:
a) at least two register means, each register means for holding a portion of an address, the address being used to form a pointer to a datum in memory to be operated on; and
b) an address forming means for concatenating the portions of the address m the register means to form the address
whereby the address is formed from portions of the address stored tn muluple registers without performing any arithmetic on the portions.
16. A processor core as in claim 15, further comprising :
c) wnting means for writing the portions of the address to the register means.
17. A processor core that operates on data retrieved from a memory, comprising:
a) a first register means for holding an N bit portion of an N+y bit address, the N+y bit address being used to form a pointer to a datum in memory to be operated on;
b) a second register means for holding a y bit portion of the N+y bit address; and
c) a writing means for writing the N bit portion of the N+y bit address to the first register means and for writing the y bit portion of the N+y bit address to the second register means; and
d) an address forming means for concatenating the N bit portion in the first register means and the y bit portion in the second register means to form the N+y bit address
whereby the N+y bit address is formed from portions of the N+y bit address stored in multiple registers widiout performing any arithmetic on the portions and die processor core can address 2y times more memory locations than 2N .
18 A processor core that operates on data retrieved from a memory, comprising:
a) a first register means for holding an N+y bit address, the address being used to form a pointer to a datum in memory to be operated on; b) a second register means for holding a copy of a y-bit portion of the address; and
c) a writing means for writing an N bit portion. other than the y bit portion, of the address to the first register means, for writing the y bit portion to the second register means, and for also writing the y bit portion to the first register means
whereby the complete address can be accessed from the first register means and the processor core can address 2y times more memory locations than 2N.
PCT/US1995/004668 1994-05-25 1995-04-14 Processor core which provides a linear extension of an addressable memory space WO1995032467A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP95916414A EP0711435A1 (en) 1994-05-25 1995-04-14 Processor core which provides a linear extension of an addressable memory space
KR1019960700389A KR960704269A (en) 1994-05-25 1995-04-14 PROCESSOR CORE WHICH PROVIDES A LINEAR EXTENSTION OF AN ADDRESSABLE MEMORY SPACE Provides Linear Expansion of Addressable Memory Space

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US24876894A 1994-05-25 1994-05-25
US08/248,768 1994-05-25

Publications (1)

Publication Number Publication Date
WO1995032467A1 true WO1995032467A1 (en) 1995-11-30

Family

ID=22940602

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1995/004668 WO1995032467A1 (en) 1994-05-25 1995-04-14 Processor core which provides a linear extension of an addressable memory space

Country Status (3)

Country Link
EP (1) EP0711435A1 (en)
KR (1) KR960704269A (en)
WO (1) WO1995032467A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3602889A (en) * 1969-02-05 1971-08-31 Honeywell Inc Extended addressing for programmed data processor having improved register loading means
US4090237A (en) * 1976-09-03 1978-05-16 Bell Telephone Laboratories, Incorporated Processor circuit
EP0510635A2 (en) * 1991-04-25 1992-10-28 Kabushiki Kaisha Toshiba Address generation in a data processing unit

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3602889A (en) * 1969-02-05 1971-08-31 Honeywell Inc Extended addressing for programmed data processor having improved register loading means
US4090237A (en) * 1976-09-03 1978-05-16 Bell Telephone Laboratories, Incorporated Processor circuit
EP0510635A2 (en) * 1991-04-25 1992-10-28 Kabushiki Kaisha Toshiba Address generation in a data processing unit

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
"128k-byte processor instruction address space for each level status block", IBM TECHNICAL DISCLOSURE BULLETIN, vol. 30, no. 7, ARMONK,US, pages 237 *
"Copy Carry to Bank Address Register Instruction", IBM TECHNICAL DISCLOSURE BULLETIN, vol. 28, no. 12, NEW YORK, US, pages 5370 - 5371 *
DEUTSCH ET AL.: "Dual-mode language programming", IBM TECHNICAL DISCLOSURE BULLETIN, vol. 11, no. 6, ARMONK,US, pages 609 - 610 *
PARSONS: "Flexible storage addressing", IBM TECHNICAL DISCLOSURE BULLETIN, vol. 6, no. 12, ARMONK,US, pages 48 - 49 *

Also Published As

Publication number Publication date
EP0711435A1 (en) 1996-05-15
KR960704269A (en) 1996-08-31

Similar Documents

Publication Publication Date Title
USRE43248E1 (en) Interoperability with multiple instruction sets
US5638525A (en) Processor capable of executing programs that contain RISC and CISC instructions
EP0213843A2 (en) Digital processor control
EP0730226B1 (en) Hardware support for fast software emulation of unimplemented instructions
US6412066B2 (en) Microprocessor employing branch instruction to set compression mode
EP0138419B1 (en) Central processing unit for a digital computer
WO2001025900A1 (en) Risc processor using register codes for expanded instruction set
US7865699B2 (en) Method and apparatus to extend the number of instruction bits in processors with fixed length instructions, in a manner compatible with existing code
US7546442B1 (en) Fixed length memory to memory arithmetic and architecture for direct memory access using fixed length instructions
EP0729605B1 (en) Processing system with word-aligned branch target
EP0772819B1 (en) Apparatus and method for efficiently determining addresses for misaligned data stored in memory
EP1680735B1 (en) Apparatus and method that accomodate multiple instruction sets and multiple decode modes
EP0984358B1 (en) Data processing apparatus
US6978358B2 (en) Executing stack-based instructions within a data processing apparatus arranged to apply operations to data items stored in registers
EP0908812B1 (en) Processor architecture scheme for implementing various addressing modes and method therefor
US5566308A (en) Processor core which provides a linear extension of an addressable memory space
US6499099B1 (en) Central processing unit method and apparatus for extending general instructions with extension data of an extension register
US6055623A (en) Specialized millicode instruction for editing functions
US6067617A (en) Specialized millicode instructions for packed decimal division
US4893235A (en) Central processing unit for a digital computer
US5915266A (en) Processor core which provides a linear extension of an addressable memory space
EP1019829B1 (en) Method and apparatus for transferring data between a register stack and a memory resource
EP0206653A2 (en) Method and means for loading and storing data in a reduced instruction set computer
WO1995032467A1 (en) Processor core which provides a linear extension of an addressable memory space
KR100322725B1 (en) Millicode flags with specialized update and branch instruction

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): DE KR

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE

WWE Wipo information: entry into national phase

Ref document number: 1995916414

Country of ref document: EP

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWP Wipo information: published in national office

Ref document number: 1995916414

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWW Wipo information: withdrawn in national office

Ref document number: 1995916414

Country of ref document: EP