US20170126249A1 - Temperature dependent multiple mode error correction - Google Patents

Temperature dependent multiple mode error correction Download PDF

Info

Publication number
US20170126249A1
US20170126249A1 US14/929,163 US201514929163A US2017126249A1 US 20170126249 A1 US20170126249 A1 US 20170126249A1 US 201514929163 A US201514929163 A US 201514929163A US 2017126249 A1 US2017126249 A1 US 2017126249A1
Authority
US
United States
Prior art keywords
error correction
array
level
memory cells
mode
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US14/929,163
Inventor
Wei Wu
Charles Augustine
Shigeki Tomishima
Shih-Lien L. Lu
James W. Tschanz
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Intel Corp
Original Assignee
Intel Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corp filed Critical Intel Corp
Priority to US14/929,163 priority Critical patent/US20170126249A1/en
Assigned to INTEL CORPORATION reassignment INTEL CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: AUGUSTINE, CHARLES, LU, SHIH-LIEN L., TOMISHIMA, SHIGEKI, TSCHANZ, JAMES W., WU, WEI
Priority to PCT/US2016/055033 priority patent/WO2017074648A1/en
Priority to CN201680057316.1A priority patent/CN108463807B/en
Priority to EP16860481.7A priority patent/EP3368984B1/en
Publication of US20170126249A1 publication Critical patent/US20170126249A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/08Error detection or correction by redundancy in data representation, e.g. by using checking codes
    • G06F11/10Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's
    • G06F11/1008Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's in individual solid state devices
    • G06F11/1048Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's in individual solid state devices using arrangements adapted for a specific error detection or correction feature
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M13/00Coding, decoding or code conversion, for error detection or error correction; Coding theory basic assumptions; Coding bounds; Error probability evaluation methods; Channel models; Simulation or testing of codes
    • H03M13/03Error detection or forward error correction by redundancy in data representation, i.e. code words containing more digits than the source words
    • H03M13/05Error detection or forward error correction by redundancy in data representation, i.e. code words containing more digits than the source words using block codes, i.e. a predetermined number of check bits joined to a predetermined number of information bits
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M13/00Coding, decoding or code conversion, for error detection or error correction; Coding theory basic assumptions; Coding bounds; Error probability evaluation methods; Channel models; Simulation or testing of codes
    • H03M13/35Unequal or adaptive error protection, e.g. by providing a different level of protection according to significance of source information or by adapting the coding according to the change of transmission channel characteristics
    • H03M13/353Adaptation to the channel
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M13/00Coding, decoding or code conversion, for error detection or error correction; Coding theory basic assumptions; Coding bounds; Error probability evaluation methods; Channel models; Simulation or testing of codes
    • H03M13/61Aspects and characteristics of methods and arrangements for error correction or error detection, not provided for otherwise
    • H03M13/611Specific encoding aspects, e.g. encoding by means of decoding
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M13/00Coding, decoding or code conversion, for error detection or error correction; Coding theory basic assumptions; Coding bounds; Error probability evaluation methods; Channel models; Simulation or testing of codes
    • H03M13/65Purpose and implementation aspects
    • H03M13/6502Reduction of hardware complexity or efficient processing
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M13/00Coding, decoding or code conversion, for error detection or error correction; Coding theory basic assumptions; Coding bounds; Error probability evaluation methods; Channel models; Simulation or testing of codes
    • H03M13/65Purpose and implementation aspects
    • H03M13/6508Flexibility, adaptability, parametrability and configurability of the implementation
    • H03M13/6516Support of multiple code parameters, e.g. generalized Reed-Solomon decoder for a variety of generator polynomials or Galois fields
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M13/00Coding, decoding or code conversion, for error detection or error correction; Coding theory basic assumptions; Coding bounds; Error probability evaluation methods; Channel models; Simulation or testing of codes
    • H03M13/03Error detection or forward error correction by redundancy in data representation, i.e. code words containing more digits than the source words
    • H03M13/05Error detection or forward error correction by redundancy in data representation, i.e. code words containing more digits than the source words using block codes, i.e. a predetermined number of check bits joined to a predetermined number of information bits
    • H03M13/13Linear codes
    • H03M13/15Cyclic codes, i.e. cyclic shifts of codewords produce other codewords, e.g. codes defined by a generator polynomial, Bose-Chaudhuri-Hocquenghem [BCH] codes
    • H03M13/151Cyclic codes, i.e. cyclic shifts of codewords produce other codewords, e.g. codes defined by a generator polynomial, Bose-Chaudhuri-Hocquenghem [BCH] codes using error location or error correction polynomials
    • H03M13/1515Reed-Solomon codes

Definitions

  • Certain embodiments of the present description relate generally to devices such as memory employing error correction.
  • errors may be encountered such that the data read from memory may not match the original data stored in memory.
  • errors may be introduced such that the data stored in memory may not match the original data intended to be written.
  • data may become corrupted while stored in a memory.
  • a memory controller may have Error Correction (or Correcting) Code (ECC) encoder logic for purposes of encoding the data before it is written into the memory cells of the memory.
  • ECC Error Correction
  • Such encoding frequently includes adding redundant data such as check bits, for example, to the original write data.
  • the encoded data when subsequently read from the memory, may be decoded by a decoder of the ECC logic of the memory controller to restore the original write data if errors are detected.
  • the ECC decoder logic can frequently detect errors which may have occurred in transmission to or from the memory or while the data was stored in memory, and correcting them if not too severe.
  • the more redundant data added to the write data by the ECC encoder logic the more robust the error detection and correcting capabilities of the ECC decoder logic of the memory controller. For example, by adding additional check bits to the encoded data, the number of errors which may be detected and corrected may be increased.
  • a memory typically has an associated Raw Bit Error Rate (RBER).
  • RBER Raw Bit Error Rate
  • the error correction scheme selected for a memory has typically been devised to provide an error detection and correction capability at a sufficiently high level to bring the error rate down to an acceptable level. For example, in some applications, an error rate of less than one error per 100,000 bits (often expressed as “1E-6”) may be acceptable.
  • ECC techniques for detecting and correcting data errors including for example, block codes such as Reed-Solomon error correction codes, for example, which process data on a block-by-block basis, and convolutional codes, for example, which process data on a bit-by-bit basis.
  • block codes such as Reed-Solomon error correction codes, for example, which process data on a block-by-block basis
  • convolutional codes for example, which process data on a bit-by-bit basis.
  • FIG. 1 depicts a high-level block diagram illustrating selected aspects of a system, employing temperature dependent, multiple mode error correction in accordance with an embodiment of the present disclosure.
  • FIGS. 2 a , 2 b depict examples of basic architectures of a memory and memory controller, employing temperature dependent, multiple mode error correction in accordance with an embodiment of the present disclosure.
  • FIGS. 3 a -3 e provide examples of operations for temperature dependent, multiple mode error correction in accordance with an embodiment of the present disclosure.
  • FIGS. 4 a -4 c illustrate in greater detail an example of an architecture of a memory and memory controller, employing temperature dependent, multiple mode error correction in accordance with an embodiment of the present disclosure.
  • FIGS. 5 a -5 c illustrate examples of various formats of data storage in a memory employing temperature dependent, multiple mode error correction in accordance with an embodiment of the present disclosure.
  • Temperature dependent, multiple mode error correction in accordance with one aspect of this disclosure, is employed for a memory circuit containing arrays of memory cells. In one aspect, it is believed that temperature dependent, multiple mode error correction in accordance with the present disclosure can reduce power consumption of the memory in one or more modes of operation. It is appreciated that other aspects and advantages may be achieved, depending upon the particular application.
  • a temperature sensor coupled to an array is configured to provide an output signal which is a function of the temperature of the array of memory cells.
  • Multiple mode error correction code (ECC) logic having an input coupled to an output of the temperature sensor, is configured to encode write data for the array of memory cells in an error correction code in one of a plurality of error correction modes as a function of the temperature of the array of memory cells.
  • ECC encoder logic in accordance with one embodiment, includes a first mode ECC encoder logic and a second mode ECC encoder logic.
  • the first mode ECC encoder logic that is, a high temperature mode (HTM) ECC encoder logic in one example, encodes write data at a first ECC level which is a relatively high ECC level to provide a relatively high degree of error detection and correction for higher temperature operating levels of the memory at which higher RBER is expected.
  • the HTM encoder logic encodes write data in a relatively long ECC code word encoding to achieve the relatively high degree of error detection and correction for higher temperature operating levels of the memory.
  • An example of a relatively high degree of error detection and correction is a double detection, double correction scheme capable of both detecting and correcting double errors in encoded data.
  • Other examples of suitable high temperature error correction include five bit error correction.
  • Still other examples include seven bit error correction which is believed to be able to lower the error rate to as low as 1E-9 at 100 degrees Celsius. It is appreciated that the level of error correction in the various temperature modes may vary, depending upon the particular application.
  • the second mode ECC encoder logic that is, a low temperature mode (LTM) ECC encoder logic in one example, encodes write data at a second ECC level which is a relatively low ECC level to provide a relatively lower degree of error detection and correction for lower level operating temperatures of the memory at which lower RBER is expected.
  • the LTM ECC encoder logic encodes write data in a relatively short ECC code word encoding to achieve the relatively lower degree of error detection and correction for lower level operating temperatures of the memory.
  • An example of a relatively low degree of error detection and correction is a double detection, single correction scheme capable of detecting double errors but correcting only single errors in encoded data. It is believed that power consumption of the memory may be reduced when operating in a lower level ECC correction mode such as the LTM mode, for example at which lower RBER is expected.
  • ECC decoder logic in accordance with one embodiment, includes a first mode ECC decoder logic and a second mode ECC decoder logic.
  • the first mode ECC decoder logic that is, a high temperature mode (HTM) ECC decoder logic in one example, is configured to decode read data from the memory which was encoded in a relatively long ECC code word encoding in the relatively high ECC level to provide a relatively high degree of error detection and correction for higher temperature operating levels of the memory at which higher RBER is expected.
  • HTM high temperature mode
  • the second mode ECC decoder logic that is, a low temperature mode (LTM) ECC decoder logic in one example, is configured to decode read data from the memory which was encoded in a relatively short ECC code word encoding in the relatively low ECC level to provide a relatively lower degree of error detection and correction for lower temperature operating levels of the memory at which higher RBER is expected. It is believed that power consumption may be reduced at lower operating temperatures by utilizing the LTM ECC decoder logic to decode read data encoded in the relatively short ECC code word.
  • LTM low temperature mode
  • the ECC encoder logic is configured to set an encoder status flag to indicate whether the write data associated with the encoder status flag was encoded in a relatively long ECC code word encoding in the relatively high ECC level or was encoded in a relatively short ECC word encoding at the second error correction level lower than the first error correction level.
  • the encoder status flag may be used, in one embodiment, to facilitate a transition from one mode to another such as, for example, from the low temperature mode (LTM) to the high temperature mode (HTM).
  • the temperature dependent, multiple mode ECC logic may be further configured to enter the first high temperature ECC mode in response to a temperature sensor output signal indicating that the temperature of the array of memory cells has risen to a temperature level that exceeds a threshold level, and to detect whether the encoder status flag has been set for a line of write data. If so, the multiple mode ECC logic may re-encode write data at the first error correction level in a relatively long ECC code word encoding in response to a detection of the encoder status flag being set for the associated write data, indicating that the write data had previously been encoded at the second error correction level in a relatively short ECC code word encoding.
  • write data may be encoded and read data decoded at the first error correction level in the high temperature (HTM) ECC mode at which higher RBER is expected.
  • the ECC encoder is logic is configured to set a decoder status flag to indicate whether the read data associated with the decoder status flag may be decoded using the relatively long ECC code word decoding in the relatively high ECC level or may be decoded using the relatively short ECC code word decoding at the second error correction level lower than the first error correction level.
  • the decoder status flag may be used, in one embodiment, to facilitate a transition from one mode to another such as, for example, from the high temperature mode (HTM) to the low temperature mode (LTM).
  • the multiple mode ECC logic may be further configured to enter the second error correction mode (LTM) in response to the temperature sensor output signal indicating that the temperature level of the array of memory cells has fallen below a threshold level, and to detect whether the decoder status flag has been set for read data to indicate that the read data associated with the decoder status flag may be decoded using the relatively short ECC code word decoding in the relatively low ECC level. If so, the multiple mode ECC logic may read and decode write data at the second error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data may be decoded at the second error correction level in the LTM ECC mode.
  • LTM second error correction mode
  • the multiple mode ECC logic detects that the decoder status flag has been set to indicate that the read data associated with the decoder status flag may be decoded using the relatively long ECC code word decoding in the relatively high ECC level
  • the multiple mode ECC logic reads and decodes the write data at the first error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data may not be decoded at the second error correction level, and resets the decode status flag to indicate that the read data may subsequently be decoded at the second error correction level.
  • temperature dependent, multiple mode error correction may be applied to embodiments having more than two temperature modes.
  • a low temperature mode may have a relatively low level of error correction for low temperature operation, such as double error detection and single error correction, for example.
  • An intermediate temperature mode may have a relatively higher level of error correction, such as five bit error correction for an intermediate temperature range such as 60-100 degrees C.
  • a third and higher temperature mode may have a still higher level of error correction such as seven bit error correction, for example, for a still higher temperature range in excess of 100 degrees Celsius, for example.
  • Encoding and decoding operations are conducted in accordance with the applicable level of the error correction capability of the associated temperature dependent mode. It is appreciated that a greater or fewer number of temperature modes may be utilized in temperature dependent, multiple mode error correction in accordance with the present description, depending upon the particular application.
  • temperature dependent, multiple mode error correction in accordance with one aspect of this disclosure may be applied to systems employing a variety of types of memory devices including non-volatile memory such as spin torque transfer (STT) Random Access Memory (RAM), three dimensional (3D) crosspoint, phase change memory, magnetic RAM, a resistive memory, nanowire memory, ferro-electric transistor random access memory (FeTRAM), flash memory such as NAND or NOR, and volatile memory such as 2D RAM, for example.
  • non-volatile memory such as spin torque transfer (STT) Random Access Memory (RAM), three dimensional (3D) crosspoint, phase change memory, magnetic RAM, a resistive memory, nanowire memory, ferro-electric transistor random access memory (FeTRAM), flash memory such as NAND or NOR, and volatile memory such as 2D RAM, for example.
  • STT spin torque transfer
  • 3D three dimensional crosspoint
  • phase change memory magnetic RAM
  • resistive memory nanowire memory
  • FeTRAM ferro-electric transistor random access memory
  • flash memory such as NAND or
  • Temperature dependent, multiple mode error correction in accordance with embodiments described herein may be used either in stand-alone memory controllers and memory circuits or logic arrays, or can be embedded in microprocessors, digital signal processors (DSPs) or other circuits transmitting or receiving data.
  • DSPs digital signal processors
  • systems and processes are described herein primarily with reference to microprocessor based systems in the illustrative examples, it will be appreciated that in view of the disclosure herein, certain aspects, architectures, and principles of the disclosure are equally applicable to other types of devices, memory and logic devices.
  • FIG. 1 is a high-level block diagram illustrating selected aspects of a system implementing temperature dependent, multiple mode error correction according to an embodiment of the present disclosure.
  • System 10 may represent any of a number of electronic and/or computing devices, that may include a memory device.
  • Such electronic and/or computing devices may include computing devices such as a mainframe, server, personal computer, workstation, telephony device, network appliance, virtualization device, storage controller, portable or mobile devices (e.g., laptops, netbooks, tablet computers, personal digital assistant (PDAs), portable media players, portable gaming devices, digital cameras, mobile phones, smartphones, feature phones, etc.) or component (e.g. system on a chip, processor, bridge, memory controller, memory, etc.).
  • PDAs personal digital assistant
  • component e.g. system on a chip, processor, bridge, memory controller, memory, etc.
  • system 10 may include more elements, fewer elements, and/or different elements.
  • system 10 may be depicted as comprising separate elements, it will be appreciated that such elements may be integrated on to one platform, such as systems on a chip (SoCs).
  • SoCs systems on a chip
  • system 10 comprises a microprocessor 20 , a memory controller 30 , a memory 40 and peripheral components 50 which may include, for example, video controller, input device, output device, storage, network adapter, etc.
  • the microprocessor 20 includes a cache 25 that may be part of a memory hierarchy to store instructions and data, and the system memory 40 may also be part of the memory hierarchy. Communication between the microprocessor 20 and the memory 40 may be facilitated by the memory controller (or chipset) 30 , which may also facilitate in communicating with the peripheral components 50 .
  • Storage of the peripheral components 50 may be, for example, non-volatile storage, such as STT RAM, solid-state drives, magnetic disk drives, optical disk drives, a tape drive, flash memory, etc.
  • the storage may comprise an internal storage device or an attached or network accessible storage.
  • the microprocessor 20 is configured to write data in and read data from the memory 40 . Programs in the storage are loaded into the memory and executed by the processor.
  • a network controller or adapter enables communication with a network, such as an Ethernet, a Fiber Channel Arbitrated Loop, etc.
  • the architecture may, in certain embodiments, include a video controller configured to render information on a display monitor, where the video controller may be embodied on a video card or integrated on integrated circuit components mounted on a motherboard or other substrate.
  • An input device is used to provide user input to the processor, and may include a keyboard, mouse, pen-stylus, microphone, touch sensitive display screen, input pins, sockets, or any other activation or input mechanism known in the art.
  • An output device is capable of rendering information transmitted from the processor, or other component, such as a display monitor, printer, storage, output pins, sockets, etc.
  • the network adapter may embodied on a network card, such as a Peripheral Component Interconnect (PCI) card, PCI-express, or some other I/O card, or on integrated circuit components mounted on a motherboard or other substrate.
  • PCI Peripheral Component Interconnect
  • One or more of the components of the device 10 may be omitted, depending upon the particular application.
  • a network router may lack a video controller, for example.
  • Any one or more of the memory devices 25 , 40 , and the other devices 10 , 30 , 50 may include temperature dependent, multiple mode error correction in accordance with the present description.
  • FIG. 2 a shows an example of an array 60 of rows and columns of bitcells or memory cells 64 of an STT RAM memory circuit 40 in accordance with one embodiment of the present description.
  • the memory circuit may be a volatile memory such as a DRAM memory, for example. It is appreciated that the memory circuit 40 may be other types of memory as well.
  • the memory 40 is a cache memory such as the cache 25 ( FIG. 1 ). It is appreciated that temperature dependent, multiple mode error correction in accordance with the present description may be applied to other types of memory, depending upon the particular application.
  • Each memory cell 64 is capable of storing a bit of data representing a logical one or logical zero value.
  • the memory controller 30 performs read operations and write operations to the memory cells 64 .
  • the STT RAM memory circuit 40 may also include a row decoder, a timer device and I/O devices (or I/O outputs). Bits of the same memory word may be separated from each other for efficient I/O design.
  • the array 60 of memory cells 64 may be subdivided, either logically or physically into a main array 60 a and one or more subarrays 60 b of memory cells as shown in FIG. 2 b.
  • a multiplexer may be used to connect each column to the required circuitry during a READ operation.
  • Another MUX may be used to connect each column to a write driver during a WRITE operation.
  • the memory of this embodiment includes multiple mode error correction code (ECC) logic 70 which may be disposed on the same integrated circuit as that of the array 60 or may be disposed on other integrated circuits such as the memory controller 40 or may be distributed over more than one integrated circuit.
  • ECC error correction code
  • a temperature sensor 80 is disposed on the same integrated circuit as the array 60 to sense the temperature of the array 60 of memory cells.
  • the temperature sensor 80 may be disposed on other integrated circuit but positioned in a suitable manner to be thermally coupled to the array 60 to facilitate sensing the temperature of the array 60 .
  • the multiple mode ECC logic 70 is configured to perform described operations using appropriate hardware, software or firmware, or various combinations thereof.
  • FIG. 3 a is directed to mode switching operations of temperature dependent, multiple mode error correction logic such as the multiple mode ECC logic 70 depicted in greater detail in FIG. 4 a .
  • the ECC logic is disposed on an integrated circuit die 400 on which the memory controller 40 is disposed. It is appreciated that in other embodiments, the ECC logic 70 may be disposed on an integrated circuit die 401 of the memory circuit 40 or on both dies 400 , 401 or on other dies, depending upon the particular application.
  • an ECC mode controller logic such as the ECC mode controller logic 402 ( FIG. 4 a ) of the multiple mode ECC logic 70 determines (block 302 , FIG. 3 a ) the current operating mode of the multiple mode ECC logic 70 .
  • the ECC mode controller logic 402 maintains an ECC mode flag 406 ( FIG. 4 a ) which indicates whether the multiple mode ECC logic 70 is operating in a high temperature mode (HTM) or a low temperature mode (LTM). If the ECC mode flag 406 indicates that the multiple mode ECC logic 70 is operating in a low temperature mode (LTM), for example, a determination (block 306 , FIG.
  • the ECC mode controller logic 402 has an input coupled to an output of the memory temperature sensor 80 which is thermally coupled to the memory array 60 to monitor the temperature of the memory.
  • the temperature threshold of the determination 306 may be a predetermined value or may be defined by a user input, for example.
  • One such temperature threshold may be a value of 60 degrees Celsius, for example, or may be within a range of 55-65 degrees, for example. It is appreciated that the particular value of the temperature threshold of the determine 306 may vary, depending upon the particular application.
  • the ECC mode controller logic 402 ( FIG. 4 a ) of the multiple mode ECC logic 70 continues to operate in the low temperature ECC mode and to monitor the memory temperature and compare it to a threshold. If it is determined (block 306 ) that the temperature of the memory array 60 being monitored has exceeded the threshold, the ECC mode controller logic 402 ( FIG. 4 a ) of the multiple mode ECC logic 70 switches (block 310 ) the mode of operation of the multiple mode ECC logic 70 to the high temperature ECC mode and sets the ECC mode flag 406 to indicate that the multiple mode ECC logic 70 is operating in a high temperature mode (HTM).
  • HTM high temperature mode
  • write data to be stored in the memory array is encoded in a relatively long ECC code word encoding to provide a relatively high ECC level for a more robust error detection and correction capability at the higher temperatures in which the RBER may increase.
  • a determination is made as to whether the temperature of the memory array being monitored, such as the memory array 60 , has dropped below a threshold.
  • the temperature threshold of the determination 314 may be the same as the threshold of the determination of block 306 .
  • the threshold of the determination of block 314 may be displaced somewhat from the threshold of the determination of block 306 . Such a displacement such as 5 degrees Celsius, for example, between thresholds may, in some applications, reduce mode switching due to small fluctuations in the temperature of the memory array.
  • a threshold for a high temperature mode to low temperature mode transition is set for 60 degrees Celsius, for example, a threshold for a low temperature mode to high temperature mode transition may be set at a displaced value such as 55 degrees Celsius, for example.
  • some processing tasks may be undertaken to prepare for a transition to another mode.
  • a displacement in thresholds may facilitate completing such preparations prior to reaching temperatures which may significantly increase RBER, for example.
  • the temperature threshold of the determination of block 314 may be a predetermined value or may be defined by a user input, for example. It is appreciated that the particular value of the temperature threshold of the determine 314 may vary, depending upon the particular application.
  • the ECC mode controller logic 402 ( FIG. 4 a ) of the multiple mode ECC logic 70 continues to operate in the high temperature ECC mode and to monitor the memory temperature and compare it to a threshold. If it is determined (block 314 ) that the temperature of the memory array 60 being monitored has dropped below the threshold, the ECC mode controller logic 402 ( FIG. 4 a ) of the multiple mode ECC logic 70 switches (block 318 ) the mode of operation of the multiple mode ECC logic 70 to the low temperature ECC mode and sets the ECC mode flag 406 to indicate that the multiple mode ECC logic 70 is operating in a low temperature mode (LTM).
  • LTM low temperature mode
  • write data to be stored in the memory array is encoded in a relatively short ECC code word encoding to provide a relatively low ECC level which provides a somewhat less robust error detection and correction capability at the lower temperatures in which the RBER may be smaller. As a consequence, power consumption may decrease accordingly.
  • FIG. 3 b is directed to operations of the multiple mode ECC logic 70 in a high temperature ECC mode.
  • the ECC mode controller logic 402 ( FIG. 4 a ) of the multiple mode ECC logic 70 sets (block 326 ) the ECC mode flag 406 to indicate that the multiple mode ECC logic 70 will be operating in the high temperature ECC mode.
  • preparations (block 330 ) may be undertaken for the high temperature ECC mode. As explained in greater detail below in connection with FIG.
  • such preparations may include scanning the memory array 60 to determine if there are lines of memory which were previously encoded in accordance with a low temperature ECC mode prior to the multiple mode ECC logic entering the high temperature ECC mode. If so, those lines of memory which were encoded in accordance with the low temperature ECC mode may be re-encoded in accordance with the high temperature mode so as to better withstand the higher RBER expected in high temperature operation of the memory.
  • memory I/O operations may be processed in a high temperature ECC mode.
  • a memory controller such as the memory controller 30 ( FIG. 4 a ) may receive write data to be written to a memory such as the memory circuit 40 .
  • the write data is encoded (block 338 ) in a relatively long ECC code word encoding by high temperature mode (HTM) ECC encoder logic 410 ( FIG. 4 a ) of an ECC Encoder Logic 412 of the multiple mode ECC Logic 70 .
  • HTM high temperature mode
  • ECC encoder logic 410 FIG. 4 a
  • ECC Encoder Logic 412 of the multiple mode ECC Logic 70 .
  • Encoding the write data in a relatively long ECC code word encoding provides a relatively high ECC level of error detection and correction capability to compensate for the relatively higher RBER expected during high temperature memory operations.
  • a suitable error correction code is a Reed-Solomon error correction code which is a block code which encodes blocks of data in symbols to facilitate error detection and correction. It is appreciated that other types of error correction codes may be employed in multiple mode error correction in accordance with the present description. For example, it is believed that both block and convolutional codes may be suitable as a long code for multiple mode error correction in accordance with the present description. Also, it is believed that both systematic and nonsystematic codes may be suitable as a long code for multiple mode error correction in accordance with the present description. In a systematic scheme, an encoder attaches to the original write data a number of check bits (such as parity data, for example), which are derived from the data bits by an appropriate deterministic algorithm.
  • check bits such as parity data, for example
  • the original message is transformed into an encoded message that typically has at least as many bits as the original message.
  • Additional examples of codes which may be suitable as long codes are concatenated codes, recursive codes, non-recursive codes, repetition codes, Hamming codes, multidimensional parity-check codes, turbo codes, low-density parity-check codes (LDPC), etc.
  • the high temperature mode (HTM) ECC encoder logic 410 is configured to encode write data for the memory circuit in a long word error correction code (such as a Reed-Solomon error correction code, for example) encoding of the temperature dependent, multiple mode error correction in accordance with one embodiment of the present description.
  • the high temperature mode (HTM) ECC encoder logic 410 encodes the write data in a long error correction code word encoding which adds redundant data such as check bits for example.
  • redundant check bit data which may be added for purposes of error detection and correction include parity bits, repetition codes, checksums and cyclic redundancy checks (CRCs).
  • the write data encoded with the long error correction code word encoding may be stored in a memory such as the memory array 60 ( FIG. 5 a ).
  • the write data encoded in the long error correction code word encoding may subsequently be decoded and checked for errors. In many cases, depending upon the severity of the detected errors, the decoded data may be corrected and restored to its original error-free form before it was written to the memory.
  • the memory array 60 is divided either logically or physically, into a main array 60 a and a sub-array 60 b such that each line of data 500 a , 500 b , . . . 500 n stored in the memory array 60 may, depending upon the ECC mode of operation, span over both the main array 60 a and the sub-array 60 b such as in the high temperature error correction mode, or may be limited to the main array 60 a such as in the low temperature error correction mode. Accordingly by not utilizing the sub-array 60 b in a mode such as the low temperature error correction mode, to store a line of ECC encoded data, energy savings may be realized, in some applications.
  • the main array 60 a includes a field 502 in which the bits of the original write data of each line of encoded data may be stored.
  • the field 502 of a cache memory may be 512 bits long, for example. It is appreciated that in other embodiments, the field 502 may be longer or shorter, depending upon the particular application.
  • ECC data field 504 stores the error correction bits of the ECC encoding for each line of data stored in the memory array 60 .
  • the ECC data field 504 spans over both the main array 60 a and the sub-array 60 b and includes a subfield 504 a which likewise, spans over both the main array 60 a and the sub-array 60 b , and is utilized to store the check bits of the long ECC word encoding in the high temperature error correction mode.
  • the long ECC word check bits field 504 a for the long ECC word encoding occupies the entire ECC data field 504 of each line of data encoded with the long ECC word encoding.
  • the size of the long ECC word encoding may vary, depending upon the particular application.
  • the long ECC word check bits field 504 a in turn includes a first subfield 504 a 1 utilized to store a first portion of the check bits of the ECC encoding in the high temperature error correction mode, and a second subfield 504 a 2 utilized to store the remaining check bits of the ECC encoding in the high temperature error correction mode.
  • the check bits of the subfield 504 a 1 may be utilized as both the check bits of the short ECC word encoding in the low temperature mode, and may also be used as the first portion of the check bits of the long ECC word encoding in the high temperature mode.
  • the check bits of the field 504 a 1 are referred to as the short word ECC check bits but may be utilized as both the short word ECC check bits and also as the first portion of the long word ECC check bits.
  • the field 504 a 1 of a cache memory may be 10 bits long, for example. It is appreciated that in other embodiments, the field 504 a 1 may be longer or shorter, depending upon the particular application.
  • the check bits of the subfield 504 a 2 are the second portion of the check bits of the long word ECC encoding in the high temperature mode and thus the check bits of the field 504 a 2 are referred to herein as the long word extension check bits or ECC extension data.
  • the field 504 a 2 of a cache memory may be 61 bits long, for example. It is appreciated that in other embodiments, the field 504 a 2 may be longer or shorter, depending upon the particular application.
  • the check bits of the long ECC word encoding of the high temperature mode includes both the short word ECC check bits of the subfield 504 a 1 and the long word extension ECC check bits of the subfield 504 a 2 , of the long word ECC check bits subfield 504 a .
  • the check bits of the short ECC word encoding of the low temperature mode may include just the short word ECC check bits of the subfield 504 a 1 of the ECC check bits field 504 . Accordingly, in the low temperature error correction mode, the active check bits are stored in the main subarray 60 a and do not extend into the subarray 60 b , that is are not stored in the subarray 60 b .
  • the check bits stored in the subarray 60 b are inactive and the subarray 60 b may be permitted to be dormant in which operation is suspended or otherwise operated at reduced or no power, to provide for reduced power consumption of the memory array 60 in the low temperature mode.
  • the LTM encoder and decoder may have reduced complexity as compared the counterpart HTM encoder and decoder, such that power consumption by the LTM encoder and decoder may be reduced as compared to that of the HTM encoder and decoder, in some applications.
  • the write data encoded with the long error correction code word encoding may subsequently be read, decoded and checked for errors.
  • a memory controller such as the memory controller 30 ( FIG. 4 a ) may perform memory read operations in the high temperature error correction mode.
  • a memory controller such as the memory controller 30 ( FIG. 2 ) causes data previously encoded in a relatively long ECC code word encoding in the relatively high ECC level of the high temperature ECC mode, to be read from a memory such as the memory circuit 40 .
  • the long ECC code word encoded read data including the long ECC word check bits stored in the field 504 a is read and decoded (block 342 ) by a high temperature mode (HTM) ECC decoder logic 416 ( FIG. 4 a ) of an ECC decoder logic 420 of the multiple mode ECC logic 70 .
  • HTM high temperature mode
  • the encoded read data is decoded from the high temperature ECC code encoding of the high temperature ECC encoder logic.
  • the high temperature, long word ECC code may be a Reed-Solomon error correction code, for example, but it is appreciated that other error correction codes may be employed in a high temperature, long word error correction code of multiple mode error correction in accordance with the present description.
  • the ECC decoder logic 416 is configured to decode read data from the memory circuit, which has been encoded in accordance with the high temperature, long word error correction code encoding. As shown in FIG. 4 a , ECC decoder logic 420 is disposed on a die which in this embodiment, is the memory controller die 400 . It is appreciated that the ECC decoder logic 420 of multiple mode error correction in accordance with the present description, may be distributed over other types of dies such as the dies of processors, I/O controllers, etc.
  • the encoded read data being processed by the high temperature mode (HTM) ECC decoder logic 416 may be checked (block 342 , FIG. 3 b ) by an error detection logic of the high temperature mode (HTM) ECC decoder logic, to determine if the decoded read data contains erroneous data.
  • Such erroneous data may have been caused by corruption of the encoded write data which may have occurred at various stages such as when the encoded write data was transmitted to the memory cell array 60 or while the encoded write data was being stored in the memory cell array 60 , or being read from the memory cell array 60 or processed by the ECC logic, for example.
  • the high temperature mode (HTM) ECC decoder logic 416 of FIG. 4 a includes a high temperature mode error detection logic 422 which utilizes the redundant data which was added by the high temperature, long word code encoding as described above, to detect whether the read data being decoded contains errors as indicated by the redundant data stored in the long word ECC check data field 504 a ( FIG. 5 a ) of the line of memory being read. If error detection logic 422 detects errors, the errors may be corrected (block 342 , FIG. 3 b ) by suitable correction logic such as the high temperature mode error correction logic 424 ( FIG. 4 b ) of the high temperature mode (HTM) ECC decoder logic 416 . Upon completion of the decoding by the high temperature mode (HTM) ECC decoder logic 416 , the encoded read data has been fully decoded (and corrected if errors detected) and may be forwarded to the entity requesting the read data.
  • HTM high temperature mode
  • the long word ECC code of one embodiment may be a Reed-Solomon code, for example.
  • Numerous algorithms may be suitable for decoding Reed-Solomon ECC codes. Examples may include Peterson decoders, Berlekamp-Massey decoders, Euclidean decoders, time domain decoders, frequency domain decoders, etc.
  • An outer ECC decoder logic may be implemented in one or more of hardware, software and firmware.
  • a determination may be made as to whether to continue in the high temperature ECC mode. Such a determination may be made as a function of the output of the memory temperature sensor 80 ( FIG. 4 a ). For example, if the output of the memory temperature sensor 80 indicates that the temperature of the memory remains above the threshold, the multiple mode error correction may remain in the high temperature mode and await the next I/O operation (block 334 ). Conversely, if the output of the memory temperature sensor 80 indicates that the temperature of the memory has fallen below the threshold, the multiple mode error correction may transition to the low temperature error correction mode described below in connection with FIG. 3 d.
  • FIG. 3 c is directed to one example of operations which may be undertaken in preparation (block 330 , FIG. 3 b ) for the high temperature ECC mode.
  • preparations may include scanning (block 350 , FIG. 3 c ) each line of the memory array 60 to determine if there are lines of memory which were previously encoded in accordance with a low temperature, short word ECC mode prior to the multiple mode ECC logic entering the high temperature ECC mode.
  • the memory array 60 is part of a cache memory. Accordingly, the main array 60 a includes a field 510 of tag data bits for each line of data stored in the cache memory.
  • the tag data field 510 includes a subfield 510 a to store bits used in connection with cache operations.
  • the tag data field 510 further includes, in this embodiment, subfields 510 b , 510 c to store ECC status bits in connection with error correction encoding and decoding operations of the associated line stored in the cache memory.
  • the status bits of the subfields may be as few as one bit each, depending upon the particular application.
  • status bits stored in the subfield 510 c may be indicate whether the associated line of data stored in the cache memory was encoded in a high temperature, long word ECC encoding, or was encoded in a low temperature, short word ECC encoding.
  • the multiple mode encoder logic 70 may read (block 350 , FIG. 3 c ) each line of the cache memory in turn and examine the associated encode status bit or bits of the encode status field 510 c to determined (block 352 , FIG. 3 c ) whether the associated line had been encoded in a low temperature, short word ECC encoding. If so, the high temperature ECC encoder logic 410 can re-encode (block 354 ) the line of data in a high temperature, long word ECC encoding to prepare that line of data for high temperature operations. For example, lines of cache memory which were encoded in accordance with the low temperature ECC mode may be re-encoded in accordance with the high temperature mode so as to better withstand the higher RBER expected in high temperature operation of the memory.
  • the associated encode status bit or bits of the encode status field 510 c may be set (block 356 ) to indicate that the associated line had been encoded in a high temperature, long word ECC encoding.
  • an associated decode status bit or bits of the decode status field 510 b may be set (block 356 ) to indicate that the associated line should be decoded in a high temperature, long word ECC decoding.
  • the scanning and re-encoding from low temperature, short word ECC encoding to high temperature, long word ECC encoding may be repeated for each line of cache memory as appropriate until all the lines of cache memory have been scanned and re-encoded as needed.
  • the multiple mode error correction logic may proceed to handle I/O operations (block 334 , FIG. 3 b ) as described above in accordance with the high temperature mode.
  • FIG. 3 d is directed to operations of the multiple mode ECC logic 70 in a low temperature ECC mode.
  • the ECC mode controller logic 402 ( FIG. 4 a ) of the multiple mode ECC logic 70 sets (block 362 ) the ECC mode flag 406 ( FIG. 4 a ) to indicate that the multiple mode ECC logic 70 will be operating in the low temperature ECC mode.
  • preparations (block 364 ) may optionally be undertaken for the low temperature ECC mode.
  • preparations for a low temperature mode may be bypassed in embodiments in which a portion of a high temperature, long word ECC encoding, may be utilized for a low temperature, short word encoding, for example.
  • the high temperature, long word encoding may be fully independent of the low temperature, short word encoding.
  • preparations for low temperature mode may include scanning the memory array 60 to determine if there are lines of memory which were previously encoded in accordance with a high temperature ECC mode prior to the multiple mode ECC logic entering the low temperature ECC mode. If so, those lines of memory which were encoded in accordance with the high temperature ECC mode may be re-encoded in accordance with the low temperature mode to accommodate the lower RBER expected in low temperature operation of the memory.
  • memory I/O operations may be processed in a low temperature ECC mode.
  • a memory controller such as the memory controller 30 ( FIG. 4 a ) may receive write data to be written to a memory such as the memory circuit 40 .
  • the write data is encoded (block 368 ) by low temperature mode (LTM) ECC encoder logic 430 ( FIG. 4 a ) of an ECC Encoder Logic 412 of the multiple mode ECC Logic 70 .
  • LTM low temperature mode
  • the LTM ECC encoder logic 430 is configured to encode write data in a relatively short ECC code word encoding which provides a relatively low ECC level of error detection and correction capability in accordance with the relatively lower RBER expected during low temperature memory operations. Because the LTM ECC encoder logic 430 employs short word encoding, the complexity of the LTM ECC encoder logic 430 may be reduced as compared to that of the HTM ECC encoder logic 410 . As a consequence, the power consumption of the LTM ECC encoder logic 430 in the low temperature mode, may be reduced as compared to that of the HTM ECC encoder logic 410 in the high temperature mode.
  • a suitable low temperature, short word error correction code is a Reed-Solomon error correction code which is a block code which encodes blocks of data in symbols to facilitate error detection and correction. It is appreciated that other types of error correction codes may be employed in low temperature mode, multiple mode error correction in accordance with the present description. For example, it is believed that both block and convolutional codes may be suitable as a short word code for multiple mode error correction in accordance with the present description. Also, it is believed that both systematic and nonsystematic codes may be suitable as a short word code for multiple mode error correction in accordance with the present description.
  • the low temperature mode (LTM) ECC encoder logic 430 is configured to encode write data for the memory circuit in a short word error correction code (such as a Reed-Solomon error correction code, for example) of the temperature dependent, multiple mode error correction in accordance with one embodiment of the present description.
  • a short word error correction code such as a Reed-Solomon error correction code, for example
  • the low temperature mode (LTM) ECC encoder logic 430 encodes the write data in a short error correction code word which adds redundant data such as check bits for example.
  • redundant check bit data which may be added for purposes of error detection and correction include parity bits, repetition codes, checksums and cyclic redundancy checks (CRCs).
  • the write data encoded with the check bits of the short error correction code word encoding may be stored (block 370 , FIG. 3 d ) in a memory such as the memory array 60 ( FIG. 5 a ).
  • the write data encoded in the short error correction code word encoding may subsequently be decoded and checked for errors. In many cases, depending upon the severity of the detected errors, the decoded data may be corrected and restored to its original error-free form before it was written to the memory.
  • the long ECC word check bits field 504 a includes a first subfield 504 a 1 utilized to store a first portion of the check bits of the ECC encoding in the high temperature error correction mode.
  • the check bits generated may be stored in the same field 504 a 1 which is designated the short word ECC data field 504 a 1 in FIG. 5 a .
  • check bits of the short ECC word encoding of the low temperature mode may include just the short word ECC check bits of the subfield 504 a 1 of the ECC check bits field 504 .
  • the subarray 60 b containing the field 504 a 2 utilized in high temperature, long word ECC encoding may be dormant or otherwise operated at reduced or no power, to provide for reduced power consumption of the memory array 60 a in the low temperature mode.
  • the encode status field 510 c for the line of data written to memory may be set (block 372 ) to indicate that the line of data was encoded in a low temperature, short word ECC encoding.
  • the decode status field 510 b may be set (block 372 ) for the line of data written to memory to indicate that the line of data is to be decoded in a low temperature, short word ECC decoding.
  • the write data encoded in a multiple mode ECC encoding may subsequently be read, decoded and checked for errors.
  • a memory controller such as the memory controller 30 ( FIG. 4 a ) may perform memory read operations in the low temperature error correction mode.
  • a memory controller such as the memory controller 30 ( FIG. 2 ) causes previously encoded data to be read and the decode status for that line of data checked (block 374 ).
  • the low temperature, short word encoded read data may be decoded (block 376 ) by a low temperature mode (LTM) ECC decoder logic 436 ( FIG. 4 a ) of the ECC decoder logic 420 of the multiple mode ECC logic 70 .
  • LTM low temperature mode
  • the low temperature, short word decoding may be a less robust error detection and correction level in keeping with the lower RBER expected in low temperature operations.
  • the complexity of the LTM ECC decoder logic 436 may be reduced as compared to that of the HTM ECC decoder logic 416 .
  • the power consumption of the LTM ECC decoder logic 436 in the low temperature mode may be reduced as compared to that of the HTM ECC decoder logic 416 in the high temperature mode.
  • the encoded read data is decoded from the low temperature, short word ECC encoding provided by the low temperature ECC encoder logic.
  • the low temperature, short word ECC code may be a Reed-Solomon error correction code, for example, but it is appreciated that other error correction codes may be employed in a low temperature, short word error correction code decoding of multiple mode error correction in accordance with the present description.
  • FIG. 4 a shows an embodiment of the memory controller die 400 of the memory controller 30 which includes the low temperature mode (LTM) ECC decoder logic 436 of an ECC decoder logic 420 of the multiple mode ECC logic 70 .
  • the ECC decoder logic 436 is configured to decode in accordance with the low temperature, short word error correction code, encoded read data from the memory circuit.
  • the encoded read data being processed by the low temperature mode (LTM) ECC decoder logic 436 may be checked (block 376 , FIG. 3 d ) by an error detection logic of the low temperature mode (LTM) ECC decoder logic, to determine if the decoded read data contains erroneous data.
  • Such erroneous data may have been caused by corruption of the encoded write data which may have occurred at various stages such as when the encoded write data was transmitted to the memory cell array 60 or while the encoded write data was being stored in the memory cell array 60 , or being read from the memory cell array 60 or processed by the ECC logic, for example.
  • the low temperature mode (LTM) ECC decoder logic 436 of FIG. 4 a includes a low temperature mode error detection logic 442 which utilizes the redundant data which was added by the low temperature, short word code encoding as described above, to detect whether the read data being decoded contains errors as indicated by the redundant data stored in the short word ECC data field 504 a 1 ( FIG. 5 a ) of the line of memory being read. If error detection logic 442 detects errors, the errors may be corrected (block 376 , FIG. 3 d ) by suitable correction logic such as the low temperature mode error correction logic 444 ( FIG. 4 c ) of the low temperature mode (LTM) ECC decoder logic 436 . Upon completion of the decoding by the low temperature mode (LTM) ECC decoder logic 436 , the encoded read data has been fully decoded (and corrected if errors detected) and may be forwarded to the entity requesting the read data.
  • LTM low temperature mode
  • the low temperature, short word ECC code of one embodiment may be a Reed-Solomon code, for example.
  • Numerous algorithms may be suitable for decoding Reed-Solomon ECC codes. Examples may include Peterson decoders, Berlekamp-Massey decoders, Euclidean decoders, time domain decoders, frequency domain decoders, etc.
  • An outer ECC decoder logic may be implemented in one or more of hardware, software and firmware.
  • a memory controller such as the memory controller 30 ( FIG. 2 ) causes previously encoded data to be read and the decode status for that line of data checked (block 374 ). If the decode status field 510 b of the line of data read from the memory indicates that the line of data is to be decoded from the high temperature, long word ECC encoding, the high temperature, long word encoded read data may be decoded (block 378 ) by the high temperature mode (HTM) ECC decoder logic 416 ( FIG. 4 a ) as described above.
  • HTM high temperature mode
  • data which was encoded for high temperature operations using high temperature, long word encoding is decoded using the high temperature, long word encoding to provide the more robust level of error detection and correction to compensate for a higher RBER which may have been encountered while the line of data was stored in the memory.
  • the line of data may be decoded using the low temperature, short word ECC encoding in keeping with the mode of operation of the multiple mode error correction being in the low temperature mode of operation.
  • the decode status is set (block 380 ) for the line of data read from memory to indicate that the line of data in subsequent readings is to be decoded in a low temperature, short word ECC decoding.
  • the check bits encoded with the high temperature, long word ECC encoding and stored in the short word ECC data field 504 a 1 may be utilized for the low temperature, short word decoding described above.
  • the check bits of the field 504 a 1 are common bits to both a low temperature, short word ECC encoding and a high temperature, long word encoding.
  • the check bits of the field 504 a 1 need not be re-encoded when transitioning from a high temperature mode of operations to a low temperature mode of operations.
  • 8,533,572 describes error correction for caches which can utilize a common set of check bits.
  • a re-encoding (block 382 ) of the data to a low temperature, short word encoding may be appropriate.
  • a determination may be made as to whether to continue in the low temperature ECC mode. Such a determination may be made as a function of the output of the memory temperature sensor 80 ( FIG. 4 a ). For example, if the output of the memory temperature sensor 80 indicates that the temperature of the memory remains below the threshold, the multiple mode error correction may remain in the low temperature mode and await the next I/O operation (block 366 ). Conversely, if the output of the memory temperature sensor 80 indicates that the temperature of the memory has risen above the threshold, the multiple mode error correction may transition to the high temperature error correction mode described above in connection with FIG. 3 b.
  • FIG. 5 b is directed to an embodiment which differs from the embodiment of FIG. 5 a in that the high temperature, long word ECC encoding does not share check bits with the low temperature, short word ECC encoding. Accordingly, the check bits stored in a short word ECC data subfield 504 b of the ECC data field 504 are independent of check bits stored in the long word ECC data field 504 a of the ECC data field 504 . As a result, check bits generated by high temperature, long word ECC encoding are not used as check bits in a low temperature mode. Instead, high temperature, long word ECC encodings are re-encoded to a low temperature, short word ECC encoding when transitioning from a high temperature mode to a low temperature mode.
  • FIG. 5 c is directed to an embodiment which differs from the embodiment of FIG. 5 a in that the high temperature, long word ECC encoding does not share check bits with the low temperature, short word ECC encoding.
  • the ECC correction of FIG. 5 c utilizes a non-systematic code, such that the original write data is transformed into an encoded message that typically has at least as many bits as the original message. Accordingly, in the embodiment of FIG. 5 c , in the high temperature, long word ECC encoding, the original write data and the redundant check data are transformed into an encoded message stored in a long word ECC encoded write data field 520 which spans both the main array 60 a and the sub-array 60 b .
  • the original write data and the redundant check data are transformed into a shorter encoded message stored in a short word ECC encoded write data field 524 within the main array 60 a and does not extend into sub-array 60 b.
  • the low temperature, short word encoding does not extend into the sub-array 60 b .
  • the sub-array 60 b may be dormant to reduce power consumption in each of the embodiments of FIG. 5 a - 5 c.
  • the high temperature encodings of the high temperature mode may be re-encoded to low temperature encodings when entering the low temperature mode from the high temperature mode.
  • FIG. 3 e is directed to one example of operations which may be undertaken in preparation (block 330 , FIG. 3 b ) for operating in the low temperature ECC mode.
  • preparations may include scanning (block 386 , FIG. 3 e ) each line of the memory array 60 .
  • the multiple mode encoder logic 70 FIG. 4 a
  • the low temperature ECC encoder logic 430 can re-encode (block 390 ) the line of data in a low temperature, short word ECC encoding to prepare that line of data for low temperature operations so as to take advantage of the lower RBER expected in low temperature operation of the memory.
  • the associated encode status bit or bits of the encode status field 510 c may be set (block 392 ) to indicate that the associated line had been encoded in a low temperature, short word ECC encoding.
  • associated decode status bit or bits of the decode status field 510 b may be set (block 392 ) to indicate that the associated line should be decoded in a low temperature, short word ECC decoding.
  • the scanning and re-encoding from high temperature, long word ECC encoding to low temperature, short word ECC encoding may be repeated for each line of cache memory as appropriate until all the lines of cache memory have been scanned and re-encoded. Once all the lines of cache memory have been scanned and re-encoded, the multiple mode error correction logic may proceed to handle I/O operations as described above in accordance with the low temperature mode.
  • temperature dependent, multiple mode error correction in accordance with the present description, may reduce power consumption in a low temperature mode.
  • Other aspects and advantages may be achieved, depending upon the particular application.
  • Example 1 is apparatus, comprising: an array of memory cells; a temperature sensor coupled to the array and having an output, said temperature sensor configured to provide an output signal at said output wherein the output signal is a function of the temperature of the array of memory cells; a memory controller electrically coupled to the array of cells of the memory circuit and configured to control the array of memory cells; and multiple mode error correction code logic having an input coupled to the temperature sensor output, said multiple mode error correction code logic configured to encode write data for the array of memory cells in an error correction code in one of a plurality of error correction modes as a function of the temperature of the array of memory cells.
  • Example 2 the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein said plurality of error correction modes includes a first error correction mode at a first error correction level at a first memory cell array temperature level, and further includes a second error correction mode at a second error correction level lower than the first error correction level at a second memory cell array temperature level lower than the first memory cell array temperature level.
  • Example 3 the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein the array of memory cells includes a subarray, and wherein the multiple mode error correction code logic is configured to encode write data for the array of memory cells in the first error correction mode at the first error correction level to provide an error correction code word having a first length, at least a portion of which is stored in the subarray of memory cells, and is configured to encode write data for the array of memory cells in the second error correction mode at the second error correction level to provide an error correction code word having a second length shorter than the first length and is not stored within the subarray, is configured to decode read data from the array of memory cells encoded in an error correction code word having the first length, and is configured to decode read data from the array of memory cells encoded in an error correction code word having a second length shorter than the first length.
  • the multiple mode error correction code logic is configured to encode write data for the array of memory cells in the first error correction mode at the first error correction level to provide an error correction code word having a first length
  • Example 4 the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein said multiple mode error correction code logic is further configured to set an encode status flag to indicate whether the write data associated with the encode status flag was encoded at the first error correction level or at the second error correction level lower than the first error correction level, and wherein said multiple mode error correction code logic is further configured to enter the first error correction mode in response to the temperature sensor output signal indicating that the temperature of the array of memory cells has exceeded a threshold level, and to scan the array of memory cells to detect whether the encode status flag has been set for write data and to re-encode write data at the first error correction level in response to a detection of the encode status flag being set for the associated write data indicating that the associated write data had been encoded at the second error correction level.
  • Example 5 the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein said multiple mode error correction code logic is further configured to reset an encode status flag for write data re-encoded at the first error correction level to indicate that the write data associated with the reset encode status flag was re-encoded at the first error correction level higher than the second error correction level.
  • Example 6 the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein the multiple mode error correction code logic is further configured to set a decode status flag for read data to indicate the error correction level at which the associated read data may be decoded.
  • Example 7 the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein said multiple mode error correction code logic is further configured to: enter the second error correction mode in response to the temperature sensor output signal indicating that the temperature of the array of memory cells has fallen below a threshold level; detect the decode status flag; read and decode data at the second error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data may be decoded at the second error correction level, and read and decode write data at the first error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data is to be decoded at the first error correction level, and reset the decode status flag to indicate that the read data may be decoded at the second error correction level.
  • Example 8 the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein the array of memory cells is a spin-transfer-torque (STT) random-access-memory (RAM) cache.
  • STT spin-transfer-torque
  • RAM random-access-memory
  • Example 9 is directed to a computing system for use with a display, comprising: a memory wherein the memory includes a memory circuit having an array of memory cells and a memory controller configured to control the array of memory cells; a processor configured to write data in and read data from the memory; a video controller configured to display information represented by data in the memory; a temperature sensor coupled to the array and having an output, said temperature sensor configured to provide an output signal at said output wherein the output signal is a function of the temperature of the array of memory cells; and multiple mode error correction code logic having an input coupled to the temperature sensor output, said multiple mode error correction code logic configured to encode write data for the array of memory cells in an error correction code in one of a plurality of error correction modes as a function of the temperature of the array of memory cells.
  • Example 10 the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein said plurality of error correction modes includes a first error correction mode at a first error correction level at a first memory cell array temperature level, and further includes a second error correction mode at a second error correction level lower than the first error correction level at a second memory cell array temperature level lower than the first memory cell array temperature level.
  • Example 11 the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein the array of memory cells includes a subarray, and wherein the multiple mode error correction code logic is configured to encode write data for the array of memory cells in the first error correction mode at the first error correction level to provide an error correction code word having a first length, at least a portion of which is stored in the subarray of memory cells, and is configured to encode write data for the array of memory cells in the second error correction mode at the second error correction level to provide an error correction code word having a second length shorter than the first length and is not stored within the subarray, is configured to decode read data from the array of memory cells encoded in an error correction code word having the first length, and is configured to decode read data from the array of memory cells encoded in an error correction code word having a second length shorter than the first length.
  • Example 12 the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein said multiple mode error correction code logic is further configured to set an encode status flag to indicate whether the write data associated with the encode status flag was encoded at the first error correction level or at the second error correction level lower than the first error correction level, and wherein said multiple mode error correction code logic is further configured to enter the first error correction mode in response to the temperature sensor output signal indicating that the temperature of the array of memory cells has exceeded a threshold level, and to scan the array of memory cells to detect whether the encode status flag has been set for write data and to re-encode write data at the first error correction level in response to a detection of the encode status flag being set for the associated write data indicating that the associated write data had been encoded at the second error correction level.
  • Example 13 the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein said multiple mode error correction code logic is further configured to reset an encode status flag for write data re-encoded at the first error correction level to indicate that the write data associated with the reset encode status flag was re-encoded at the first error correction level higher than the second error correction level.
  • Example 14 the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein the multiple mode error correction code logic is further configured to set a decode status flag for read data to indicate the error correction level at which the associated read data may be decoded.
  • Example 15 the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein said multiple mode error correction code logic is further configured to: enter the second error correction mode in response to the temperature sensor output signal indicating that the temperature of the array of memory cells has fallen below a threshold level; detect the decode status flag; read and decode data at the second error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data may be decoded at the second error correction level, and read and decode write data at the first error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data is to be decoded at the first error correction level, and reset the decode status flag to indicate that the read data may be decoded at the second error correction level.
  • Example 16 the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein the array of memory cells is a spin-transfer-torque (STT) random-access-memory (RAM) cache.
  • STT spin-transfer-torque
  • RAM random-access-memory
  • Example 17 is a method, comprising: sensing the temperature of an array of memory cells using a temperature sensor coupled to the array; and performing error correction processing of data using multiple mode error correction code logic, said processing including encoding write data for the array of memory cells in an error correction code in one of a plurality of error correction modes as a function of the temperature of the array of memory cells.
  • Example 18 the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein said encoding write data for the array of memory cells in an error correction code in one of a plurality of error correction modes includes encoding write data in a first error correction mode at a first error correction level at a first memory cell array temperature level, and further includes encoding write data in a second error correction mode at a second error correction level lower than the first error correction level at a second memory cell array temperature level lower than the first memory cell array temperature level.
  • Example 19 the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein the array of memory cells includes a subarray, and wherein the encoding write data in a first error correction mode includes providing an error correction code word having a first length, and storing at least a portion of the error correction code word having the first length in the subarray of memory cells, and encoding write data in a second error correction mode includes providing an error correction code word having a second length shorter than the first length, storing the error correction having the second length outside the subarray and suspending operation of the subarray, said processing further including decoding read data from the array of memory cells encoded in an error correction code word having the first length, and decoding read data from the array of memory cells encoded in an error correction code word having a second length shorter than the first length.
  • Example 20 the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein said processing further includes setting an encode status flag to indicate whether the write data associated with the encode status flag was encoded at the first error correction level or at the second error correction level lower than the first error correction level, and further includes entering the first error correction mode in response to the temperature sensor indicating that the temperature of the array of memory cells has exceeded a threshold level, and scanning the array of memory cells to detect the encode status flag and re-encoding write data at the first error correction level in response to a detection of the encode status flag for the associated write data indicating that the associated write data had been encoded at the second error correction level.
  • Example 21 the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein the processing further includes resetting an encode status flag for write data re-encoded at the first error correction level to indicate that the write data associated with the reset encode status flag was re-encoded at the first error correction level higher than the second error correction level.
  • Example 22 the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein the processing further includes setting a decode status flag for read data to indicate the error correction level at which the associated read data may be decoded.
  • Example 23 the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein the processing further includes: entering the second error correction mode in response to the temperature sensor indicating that the temperature of the array of memory cells has fallen below a threshold level; detecting the decode status flag; reading and decoding read data at the second error correction level in response to a detection of the decode status flag indicating that the read data may be decoded at the second error correction level, and reading and decoding read data at the first error correction level in response to a detection of the decode status flag indicating that the read data is to be decoded at the first error correction level, and resetting the decode status flag to indicate that the read data may be decoded at the second error correction level.
  • Example 24 the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein the array of memory cells is a spin-transfer-torque (STT) random-access-memory (RAM) cache.
  • STT spin-transfer-torque
  • RAM random-access-memory
  • Example 26 is an apparatus comprising means to perform a method as claimed in any preceding example.
  • the described operations may be implemented as a method, apparatus or computer program product using standard programming and/or engineering techniques to produce software, firmware, hardware, or any combination thereof.
  • the described operations may be implemented as computer program code maintained in a “computer readable storage medium”, where a processor may read and execute the code from the computer storage readable medium.
  • the computer readable storage medium includes at least one of electronic circuitry, storage materials, inorganic materials, organic materials, biological materials, a casing, a housing, a coating, and hardware.
  • a computer readable storage medium may comprise, but is not limited to, a magnetic storage medium (e.g., hard disk drives, floppy disks, tape, etc.), optical storage (CD-ROMs, DVDs, optical disks, etc.), volatile and non-volatile memory devices (e.g., EEPROMs, ROMs, PROMs, RAMs, DRAMs, SRAMs, Flash Memory, firmware, programmable logic, etc.), Solid State Devices (SSD), etc.
  • the code implementing the described operations may further be implemented in hardware logic implemented in a hardware device (e.g., an integrated circuit chip, Programmable Gate Array (PGA), Application Specific Integrated Circuit (ASIC), etc.).
  • the code implementing the described operations may be implemented in “transmission signals”, where transmission signals may propagate through space or through a transmission media, such as an optical fiber, copper wire, etc.
  • the transmission signals in which the code or logic is encoded may further comprise a wireless signal, satellite transmission, radio waves, infrared signals, Bluetooth, etc.
  • the program code embedded on a computer readable storage medium may be transmitted as transmission signals from a transmitting station or computer to a receiving station or computer.
  • a computer readable storage medium is not comprised solely of transmissions signals.
  • a device in accordance with the present description may be embodied in a computer system including a video controller to render information to display on a monitor or other display coupled to the computer system, a device driver and a network controller, such as a computer system comprising a desktop, workstation, server, mainframe, laptop, handheld computer, etc.
  • the device embodiments may be embodied in a computing device that does not include, for example, a video controller, such as a switch, router, etc., or does not include a network controller, for example.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Probability & Statistics with Applications (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Techniques For Improving Reliability Of Storages (AREA)
  • Read Only Memory (AREA)

Abstract

In one embodiment, temperature dependent, multiple mode error correction in accordance with one aspect of this disclosure, is employed for a memory circuit containing arrays of memory cells. In one embodiment, a temperature sensor coupled to an array is configured to provide an output signal which is a function of the temperature of the array of memory cells. Multiple mode error correction code (ECC) logic having an input coupled to an output of the temperature sensor, is configured to encode write data and decode read data for the array of memory cells in an error correction code in one of a plurality of error correction modes as a function of the temperature of the array of memory cells. Other aspects are described herein.

Description

    TECHNICAL FIELD
  • Certain embodiments of the present description relate generally to devices such as memory employing error correction.
  • BACKGROUND
  • In the course of reading data from a memory, errors may be encountered such that the data read from memory may not match the original data stored in memory. Similarly, as data is written to the memory, errors may be introduced such that the data stored in memory may not match the original data intended to be written. Also, data may become corrupted while stored in a memory.
  • Various techniques are frequently employed for purposes of detecting such errors and correcting them if possible. For example, a memory controller may have Error Correction (or Correcting) Code (ECC) encoder logic for purposes of encoding the data before it is written into the memory cells of the memory. Such encoding frequently includes adding redundant data such as check bits, for example, to the original write data. The encoded data when subsequently read from the memory, may be decoded by a decoder of the ECC logic of the memory controller to restore the original write data if errors are detected. The ECC decoder logic can frequently detect errors which may have occurred in transmission to or from the memory or while the data was stored in memory, and correcting them if not too severe. Generally, the more redundant data added to the write data by the ECC encoder logic, the more robust the error detection and correcting capabilities of the ECC decoder logic of the memory controller. For example, by adding additional check bits to the encoded data, the number of errors which may be detected and corrected may be increased.
  • A memory typically has an associated Raw Bit Error Rate (RBER). Thus, the error correction scheme selected for a memory has typically been devised to provide an error detection and correction capability at a sufficiently high level to bring the error rate down to an acceptable level. For example, in some applications, an error rate of less than one error per 100,000 bits (often expressed as “1E-6”) may be acceptable.
  • There are various known ECC techniques for detecting and correcting data errors including for example, block codes such as Reed-Solomon error correction codes, for example, which process data on a block-by-block basis, and convolutional codes, for example, which process data on a bit-by-bit basis. In some applications such as deep space transmission of data between Earth and space probes, and compact disk recording and playback devices, for example, it is known to encode data twice using two different ECC schemes which are concatenated together as an outer ECC scheme and an inner ECC scheme, to operate in sequence on the data. By concatenating the ECC techniques, error detection and correction can be enhanced in some applications, as compared to employing just one of the ECC techniques alone.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • Embodiments of the present disclosure are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings in which like reference numerals refer to similar elements.
  • FIG. 1 depicts a high-level block diagram illustrating selected aspects of a system, employing temperature dependent, multiple mode error correction in accordance with an embodiment of the present disclosure.
  • FIGS. 2a, 2b depict examples of basic architectures of a memory and memory controller, employing temperature dependent, multiple mode error correction in accordance with an embodiment of the present disclosure.
  • FIGS. 3a-3e provide examples of operations for temperature dependent, multiple mode error correction in accordance with an embodiment of the present disclosure.
  • FIGS. 4a-4c illustrate in greater detail an example of an architecture of a memory and memory controller, employing temperature dependent, multiple mode error correction in accordance with an embodiment of the present disclosure.
  • FIGS. 5a-5c illustrate examples of various formats of data storage in a memory employing temperature dependent, multiple mode error correction in accordance with an embodiment of the present disclosure.
  • DESCRIPTION OF EMBODIMENTS
  • In the description that follows, like components have been given the same reference numerals, regardless of whether they are shown in different embodiments. To illustrate an embodiment(s) of the present disclosure in a clear and concise manner, the drawings may not necessarily be to scale and certain features may be shown in somewhat schematic form. Features that are described and/or illustrated with respect to one embodiment may be used in the same way or in a similar way in one or more other embodiments and/or in combination with or instead of the features of the other embodiments.
  • Temperature dependent, multiple mode error correction in accordance with one aspect of this disclosure, is employed for a memory circuit containing arrays of memory cells. In one aspect, it is believed that temperature dependent, multiple mode error correction in accordance with the present disclosure can reduce power consumption of the memory in one or more modes of operation. It is appreciated that other aspects and advantages may be achieved, depending upon the particular application.
  • In one embodiment, a temperature sensor coupled to an array is configured to provide an output signal which is a function of the temperature of the array of memory cells. Multiple mode error correction code (ECC) logic having an input coupled to an output of the temperature sensor, is configured to encode write data for the array of memory cells in an error correction code in one of a plurality of error correction modes as a function of the temperature of the array of memory cells.
  • For example, ECC encoder logic in accordance with one embodiment, includes a first mode ECC encoder logic and a second mode ECC encoder logic. The first mode ECC encoder logic, that is, a high temperature mode (HTM) ECC encoder logic in one example, encodes write data at a first ECC level which is a relatively high ECC level to provide a relatively high degree of error detection and correction for higher temperature operating levels of the memory at which higher RBER is expected. In one embodiment, the HTM encoder logic encodes write data in a relatively long ECC code word encoding to achieve the relatively high degree of error detection and correction for higher temperature operating levels of the memory. An example of a relatively high degree of error detection and correction is a double detection, double correction scheme capable of both detecting and correcting double errors in encoded data. Other examples of suitable high temperature error correction include five bit error correction. Still other examples include seven bit error correction which is believed to be able to lower the error rate to as low as 1E-9 at 100 degrees Celsius. It is appreciated that the level of error correction in the various temperature modes may vary, depending upon the particular application.
  • Conversely, the second mode ECC encoder logic, that is, a low temperature mode (LTM) ECC encoder logic in one example, encodes write data at a second ECC level which is a relatively low ECC level to provide a relatively lower degree of error detection and correction for lower level operating temperatures of the memory at which lower RBER is expected. In one embodiment, the LTM ECC encoder logic encodes write data in a relatively short ECC code word encoding to achieve the relatively lower degree of error detection and correction for lower level operating temperatures of the memory. An example of a relatively low degree of error detection and correction is a double detection, single correction scheme capable of detecting double errors but correcting only single errors in encoded data. It is believed that power consumption of the memory may be reduced when operating in a lower level ECC correction mode such as the LTM mode, for example at which lower RBER is expected.
  • In another aspect, ECC decoder logic in accordance with one embodiment, includes a first mode ECC decoder logic and a second mode ECC decoder logic. The first mode ECC decoder logic, that is, a high temperature mode (HTM) ECC decoder logic in one example, is configured to decode read data from the memory which was encoded in a relatively long ECC code word encoding in the relatively high ECC level to provide a relatively high degree of error detection and correction for higher temperature operating levels of the memory at which higher RBER is expected. Similarly, the second mode ECC decoder logic, that is, a low temperature mode (LTM) ECC decoder logic in one example, is configured to decode read data from the memory which was encoded in a relatively short ECC code word encoding in the relatively low ECC level to provide a relatively lower degree of error detection and correction for lower temperature operating levels of the memory at which higher RBER is expected. It is believed that power consumption may be reduced at lower operating temperatures by utilizing the LTM ECC decoder logic to decode read data encoded in the relatively short ECC code word.
  • In still another aspect, the ECC encoder logic is configured to set an encoder status flag to indicate whether the write data associated with the encoder status flag was encoded in a relatively long ECC code word encoding in the relatively high ECC level or was encoded in a relatively short ECC word encoding at the second error correction level lower than the first error correction level. The encoder status flag may be used, in one embodiment, to facilitate a transition from one mode to another such as, for example, from the low temperature mode (LTM) to the high temperature mode (HTM).
  • For example, the temperature dependent, multiple mode ECC logic may be further configured to enter the first high temperature ECC mode in response to a temperature sensor output signal indicating that the temperature of the array of memory cells has risen to a temperature level that exceeds a threshold level, and to detect whether the encoder status flag has been set for a line of write data. If so, the multiple mode ECC logic may re-encode write data at the first error correction level in a relatively long ECC code word encoding in response to a detection of the encoder status flag being set for the associated write data, indicating that the write data had previously been encoded at the second error correction level in a relatively short ECC code word encoding. Upon completion of this transition to the high temperature (HTM) ECC mode, write data may be encoded and read data decoded at the first error correction level in the high temperature (HTM) ECC mode at which higher RBER is expected.
  • In still another aspect, the ECC encoder is logic is configured to set a decoder status flag to indicate whether the read data associated with the decoder status flag may be decoded using the relatively long ECC code word decoding in the relatively high ECC level or may be decoded using the relatively short ECC code word decoding at the second error correction level lower than the first error correction level. The decoder status flag may be used, in one embodiment, to facilitate a transition from one mode to another such as, for example, from the high temperature mode (HTM) to the low temperature mode (LTM).
  • For example, the multiple mode ECC logic may be further configured to enter the second error correction mode (LTM) in response to the temperature sensor output signal indicating that the temperature level of the array of memory cells has fallen below a threshold level, and to detect whether the decoder status flag has been set for read data to indicate that the read data associated with the decoder status flag may be decoded using the relatively short ECC code word decoding in the relatively low ECC level. If so, the multiple mode ECC logic may read and decode write data at the second error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data may be decoded at the second error correction level in the LTM ECC mode.
  • Alternatively, if the multiple mode ECC logic detects that the decoder status flag has been set to indicate that the read data associated with the decoder status flag may be decoded using the relatively long ECC code word decoding in the relatively high ECC level, the multiple mode ECC logic reads and decodes the write data at the first error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data may not be decoded at the second error correction level, and resets the decode status flag to indicate that the read data may subsequently be decoded at the second error correction level.
  • Although certain embodiments are described herein in connection with two temperature dependent modes, it is appreciated that temperature dependent, multiple mode error correction may be applied to embodiments having more than two temperature modes. For example, a low temperature mode may have a relatively low level of error correction for low temperature operation, such as double error detection and single error correction, for example. An intermediate temperature mode may have a relatively higher level of error correction, such as five bit error correction for an intermediate temperature range such as 60-100 degrees C. A third and higher temperature mode may have a still higher level of error correction such as seven bit error correction, for example, for a still higher temperature range in excess of 100 degrees Celsius, for example. Encoding and decoding operations are conducted in accordance with the applicable level of the error correction capability of the associated temperature dependent mode. It is appreciated that a greater or fewer number of temperature modes may be utilized in temperature dependent, multiple mode error correction in accordance with the present description, depending upon the particular application.
  • It is believed that temperature dependent, multiple mode error correction in accordance with one aspect of this disclosure may be applied to systems employing a variety of types of memory devices including non-volatile memory such as spin torque transfer (STT) Random Access Memory (RAM), three dimensional (3D) crosspoint, phase change memory, magnetic RAM, a resistive memory, nanowire memory, ferro-electric transistor random access memory (FeTRAM), flash memory such as NAND or NOR, and volatile memory such as 2D RAM, for example. Other types of memory may be suitable as well. Temperature dependent, multiple mode error correction in accordance with embodiments described herein may be used either in stand-alone memory controllers and memory circuits or logic arrays, or can be embedded in microprocessors, digital signal processors (DSPs) or other circuits transmitting or receiving data. Additionally, it is noted that although systems and processes are described herein primarily with reference to microprocessor based systems in the illustrative examples, it will be appreciated that in view of the disclosure herein, certain aspects, architectures, and principles of the disclosure are equally applicable to other types of devices, memory and logic devices.
  • Turning to the figures, FIG. 1 is a high-level block diagram illustrating selected aspects of a system implementing temperature dependent, multiple mode error correction according to an embodiment of the present disclosure. System 10 may represent any of a number of electronic and/or computing devices, that may include a memory device. Such electronic and/or computing devices may include computing devices such as a mainframe, server, personal computer, workstation, telephony device, network appliance, virtualization device, storage controller, portable or mobile devices (e.g., laptops, netbooks, tablet computers, personal digital assistant (PDAs), portable media players, portable gaming devices, digital cameras, mobile phones, smartphones, feature phones, etc.) or component (e.g. system on a chip, processor, bridge, memory controller, memory, etc.). In alternative embodiments, system 10 may include more elements, fewer elements, and/or different elements. Moreover, although system 10 may be depicted as comprising separate elements, it will be appreciated that such elements may be integrated on to one platform, such as systems on a chip (SoCs). In the illustrative example, system 10 comprises a microprocessor 20, a memory controller 30, a memory 40 and peripheral components 50 which may include, for example, video controller, input device, output device, storage, network adapter, etc. The microprocessor 20 includes a cache 25 that may be part of a memory hierarchy to store instructions and data, and the system memory 40 may also be part of the memory hierarchy. Communication between the microprocessor 20 and the memory 40 may be facilitated by the memory controller (or chipset) 30, which may also facilitate in communicating with the peripheral components 50.
  • Storage of the peripheral components 50 may be, for example, non-volatile storage, such as STT RAM, solid-state drives, magnetic disk drives, optical disk drives, a tape drive, flash memory, etc. The storage may comprise an internal storage device or an attached or network accessible storage. The microprocessor 20 is configured to write data in and read data from the memory 40. Programs in the storage are loaded into the memory and executed by the processor. A network controller or adapter enables communication with a network, such as an Ethernet, a Fiber Channel Arbitrated Loop, etc. Further, the architecture may, in certain embodiments, include a video controller configured to render information on a display monitor, where the video controller may be embodied on a video card or integrated on integrated circuit components mounted on a motherboard or other substrate. An input device is used to provide user input to the processor, and may include a keyboard, mouse, pen-stylus, microphone, touch sensitive display screen, input pins, sockets, or any other activation or input mechanism known in the art. An output device is capable of rendering information transmitted from the processor, or other component, such as a display monitor, printer, storage, output pins, sockets, etc. The network adapter may embodied on a network card, such as a Peripheral Component Interconnect (PCI) card, PCI-express, or some other I/O card, or on integrated circuit components mounted on a motherboard or other substrate.
  • One or more of the components of the device 10 may be omitted, depending upon the particular application. For example, a network router may lack a video controller, for example. Any one or more of the memory devices 25, 40, and the other devices 10, 30, 50 may include temperature dependent, multiple mode error correction in accordance with the present description.
  • FIG. 2a shows an example of an array 60 of rows and columns of bitcells or memory cells 64 of an STT RAM memory circuit 40 in accordance with one embodiment of the present description. In another embodiment, the memory circuit may be a volatile memory such as a DRAM memory, for example. It is appreciated that the memory circuit 40 may be other types of memory as well.
  • In the embodiment of FIG. 2a , the memory 40 is a cache memory such as the cache 25 (FIG. 1). It is appreciated that temperature dependent, multiple mode error correction in accordance with the present description may be applied to other types of memory, depending upon the particular application.
  • Each memory cell 64 is capable of storing a bit of data representing a logical one or logical zero value. The memory controller 30 performs read operations and write operations to the memory cells 64. The STT RAM memory circuit 40 may also include a row decoder, a timer device and I/O devices (or I/O outputs). Bits of the same memory word may be separated from each other for efficient I/O design. Thus, the array 60 of memory cells 64 may be subdivided, either logically or physically into a main array 60 a and one or more subarrays 60 b of memory cells as shown in FIG. 2 b.
  • A multiplexer (MUX) may be used to connect each column to the required circuitry during a READ operation. Another MUX may be used to connect each column to a write driver during a WRITE operation.
  • As explained in greater detail below, the memory of this embodiment, includes multiple mode error correction code (ECC) logic 70 which may be disposed on the same integrated circuit as that of the array 60 or may be disposed on other integrated circuits such as the memory controller 40 or may be distributed over more than one integrated circuit. In this embodiment, a temperature sensor 80 is disposed on the same integrated circuit as the array 60 to sense the temperature of the array 60 of memory cells. However, it is appreciated that in other embodiments, the temperature sensor 80 may be disposed on other integrated circuit but positioned in a suitable manner to be thermally coupled to the array 60 to facilitate sensing the temperature of the array 60.
  • The multiple mode ECC logic 70 is configured to perform described operations using appropriate hardware, software or firmware, or various combinations thereof. FIGS. 3a-3e depict examples of temperature dependent, multiple mode error correction memory operations in accordance with one aspect the present description. FIG. 3a is directed to mode switching operations of temperature dependent, multiple mode error correction logic such as the multiple mode ECC logic 70 depicted in greater detail in FIG. 4a . In the embodiment of FIG. 4a , the ECC logic is disposed on an integrated circuit die 400 on which the memory controller 40 is disposed. It is appreciated that in other embodiments, the ECC logic 70 may be disposed on an integrated circuit die 401 of the memory circuit 40 or on both dies 400, 401 or on other dies, depending upon the particular application.
  • In one operation, an ECC mode controller logic such as the ECC mode controller logic 402 (FIG. 4a ) of the multiple mode ECC logic 70 determines (block 302, FIG. 3a ) the current operating mode of the multiple mode ECC logic 70. In one embodiment, the ECC mode controller logic 402 maintains an ECC mode flag 406 (FIG. 4a ) which indicates whether the multiple mode ECC logic 70 is operating in a high temperature mode (HTM) or a low temperature mode (LTM). If the ECC mode flag 406 indicates that the multiple mode ECC logic 70 is operating in a low temperature mode (LTM), for example, a determination (block 306, FIG. 3a ) is made as to whether the temperature of the memory array being monitored, such as the memory array 60, has exceeded a threshold. In this embodiment, the ECC mode controller logic 402 has an input coupled to an output of the memory temperature sensor 80 which is thermally coupled to the memory array 60 to monitor the temperature of the memory. In one example, the temperature threshold of the determination 306 may be a predetermined value or may be defined by a user input, for example. One such temperature threshold may be a value of 60 degrees Celsius, for example, or may be within a range of 55-65 degrees, for example. It is appreciated that the particular value of the temperature threshold of the determine 306 may vary, depending upon the particular application.
  • If it is determined (block 306) that the temperature of the memory array 60 being monitored has not exceeded the threshold, the ECC mode controller logic 402 (FIG. 4a ) of the multiple mode ECC logic 70 continues to operate in the low temperature ECC mode and to monitor the memory temperature and compare it to a threshold. If it is determined (block 306) that the temperature of the memory array 60 being monitored has exceeded the threshold, the ECC mode controller logic 402 (FIG. 4a ) of the multiple mode ECC logic 70 switches (block 310) the mode of operation of the multiple mode ECC logic 70 to the high temperature ECC mode and sets the ECC mode flag 406 to indicate that the multiple mode ECC logic 70 is operating in a high temperature mode (HTM). As explained in greater detail below, in the high temperature ECC mode, write data to be stored in the memory array is encoded in a relatively long ECC code word encoding to provide a relatively high ECC level for a more robust error detection and correction capability at the higher temperatures in which the RBER may increase.
  • In the high temperature ECC mode, a determination (block 314, FIG. 3a ) is made as to whether the temperature of the memory array being monitored, such as the memory array 60, has dropped below a threshold. In one example, the temperature threshold of the determination 314 may be the same as the threshold of the determination of block 306. In another example, the threshold of the determination of block 314 may be displaced somewhat from the threshold of the determination of block 306. Such a displacement such as 5 degrees Celsius, for example, between thresholds may, in some applications, reduce mode switching due to small fluctuations in the temperature of the memory array. Thus, if the threshold for a high temperature mode to low temperature mode transition is set for 60 degrees Celsius, for example, a threshold for a low temperature mode to high temperature mode transition may be set at a displaced value such as 55 degrees Celsius, for example.
  • Also, as explained in greater detail below, in some embodiments, some processing tasks may be undertaken to prepare for a transition to another mode. As such, a displacement in thresholds may facilitate completing such preparations prior to reaching temperatures which may significantly increase RBER, for example. The temperature threshold of the determination of block 314 may be a predetermined value or may be defined by a user input, for example. It is appreciated that the particular value of the temperature threshold of the determine 314 may vary, depending upon the particular application.
  • If it is determined (block 314) that the temperature of the memory array 60 being monitored has not dropped below the applicable threshold, the ECC mode controller logic 402 (FIG. 4a ) of the multiple mode ECC logic 70 continues to operate in the high temperature ECC mode and to monitor the memory temperature and compare it to a threshold. If it is determined (block 314) that the temperature of the memory array 60 being monitored has dropped below the threshold, the ECC mode controller logic 402 (FIG. 4a ) of the multiple mode ECC logic 70 switches (block 318) the mode of operation of the multiple mode ECC logic 70 to the low temperature ECC mode and sets the ECC mode flag 406 to indicate that the multiple mode ECC logic 70 is operating in a low temperature mode (LTM). As explained in greater detail below, in the low temperature ECC mode, write data to be stored in the memory array is encoded in a relatively short ECC code word encoding to provide a relatively low ECC level which provides a somewhat less robust error detection and correction capability at the lower temperatures in which the RBER may be smaller. As a consequence, power consumption may decrease accordingly.
  • FIG. 3b is directed to operations of the multiple mode ECC logic 70 in a high temperature ECC mode. In connection with entering (block 322, FIG. 3b ) the high temperature ECC mode, the ECC mode controller logic 402 (FIG. 4a ) of the multiple mode ECC logic 70 sets (block 326) the ECC mode flag 406 to indicate that the multiple mode ECC logic 70 will be operating in the high temperature ECC mode. In accordance with a transition from a low temperature ECC mode to a high temperature ECC mode, preparations (block 330) may be undertaken for the high temperature ECC mode. As explained in greater detail below in connection with FIG. 3c , in one embodiment, such preparations may include scanning the memory array 60 to determine if there are lines of memory which were previously encoded in accordance with a low temperature ECC mode prior to the multiple mode ECC logic entering the high temperature ECC mode. If so, those lines of memory which were encoded in accordance with the low temperature ECC mode may be re-encoded in accordance with the high temperature mode so as to better withstand the higher RBER expected in high temperature operation of the memory.
  • Upon completion of the preparation of the memory for the high temperature operations, memory I/O operations (block 334) may be processed in a high temperature ECC mode. For example, a memory controller such as the memory controller 30 (FIG. 4a ) may receive write data to be written to a memory such as the memory circuit 40. Before the write data is written to the memory, the write data is encoded (block 338) in a relatively long ECC code word encoding by high temperature mode (HTM) ECC encoder logic 410 (FIG. 4a ) of an ECC Encoder Logic 412 of the multiple mode ECC Logic 70. Encoding the write data in a relatively long ECC code word encoding provides a relatively high ECC level of error detection and correction capability to compensate for the relatively higher RBER expected during high temperature memory operations.
  • One example of a suitable error correction code is a Reed-Solomon error correction code which is a block code which encodes blocks of data in symbols to facilitate error detection and correction. It is appreciated that other types of error correction codes may be employed in multiple mode error correction in accordance with the present description. For example, it is believed that both block and convolutional codes may be suitable as a long code for multiple mode error correction in accordance with the present description. Also, it is believed that both systematic and nonsystematic codes may be suitable as a long code for multiple mode error correction in accordance with the present description. In a systematic scheme, an encoder attaches to the original write data a number of check bits (such as parity data, for example), which are derived from the data bits by an appropriate deterministic algorithm. In a system that uses a non-systematic code, the original message is transformed into an encoded message that typically has at least as many bits as the original message. Additional examples of codes which may be suitable as long codes are concatenated codes, recursive codes, non-recursive codes, repetition codes, Hamming codes, multidimensional parity-check codes, turbo codes, low-density parity-check codes (LDPC), etc. The high temperature mode (HTM) ECC encoder logic 410 is configured to encode write data for the memory circuit in a long word error correction code (such as a Reed-Solomon error correction code, for example) encoding of the temperature dependent, multiple mode error correction in accordance with one embodiment of the present description.
  • In one embodiment, the high temperature mode (HTM) ECC encoder logic 410 encodes the write data in a long error correction code word encoding which adds redundant data such as check bits for example. Examples of redundant check bit data which may be added for purposes of error detection and correction include parity bits, repetition codes, checksums and cyclic redundancy checks (CRCs). The write data encoded with the long error correction code word encoding may be stored in a memory such as the memory array 60 (FIG. 5a ). As explained in greater detail below, the write data encoded in the long error correction code word encoding may subsequently be decoded and checked for errors. In many cases, depending upon the severity of the detected errors, the decoded data may be corrected and restored to its original error-free form before it was written to the memory.
  • In the illustrated embodiment of FIG. 5a , the memory array 60 is divided either logically or physically, into a main array 60 a and a sub-array 60 b such that each line of data 500 a, 500 b, . . . 500 n stored in the memory array 60 may, depending upon the ECC mode of operation, span over both the main array 60 a and the sub-array 60 b such as in the high temperature error correction mode, or may be limited to the main array 60 a such as in the low temperature error correction mode. Accordingly by not utilizing the sub-array 60 b in a mode such as the low temperature error correction mode, to store a line of ECC encoded data, energy savings may be realized, in some applications.
  • In this embodiment, the main array 60 a includes a field 502 in which the bits of the original write data of each line of encoded data may be stored. In one example, the field 502 of a cache memory may be 512 bits long, for example. It is appreciated that in other embodiments, the field 502 may be longer or shorter, depending upon the particular application.
  • Another field, designated the ECC data field 504, stores the error correction bits of the ECC encoding for each line of data stored in the memory array 60. In this embodiment, the ECC data field 504 spans over both the main array 60 a and the sub-array 60 b and includes a subfield 504 a which likewise, spans over both the main array 60 a and the sub-array 60 b, and is utilized to store the check bits of the long ECC word encoding in the high temperature error correction mode. In the embodiment of FIG. 5a , the long ECC word check bits field 504 a for the long ECC word encoding occupies the entire ECC data field 504 of each line of data encoded with the long ECC word encoding. However, it is appreciated that in other embodiments, the size of the long ECC word encoding may vary, depending upon the particular application.
  • The long ECC word check bits field 504 a in turn includes a first subfield 504 a 1 utilized to store a first portion of the check bits of the ECC encoding in the high temperature error correction mode, and a second subfield 504 a 2 utilized to store the remaining check bits of the ECC encoding in the high temperature error correction mode. In one embodiment, the check bits of the subfield 504 a 1 may be utilized as both the check bits of the short ECC word encoding in the low temperature mode, and may also be used as the first portion of the check bits of the long ECC word encoding in the high temperature mode. Hence, the check bits of the field 504 a 1 are referred to as the short word ECC check bits but may be utilized as both the short word ECC check bits and also as the first portion of the long word ECC check bits. In one example, the field 504 a 1 of a cache memory may be 10 bits long, for example. It is appreciated that in other embodiments, the field 504 a 1 may be longer or shorter, depending upon the particular application.
  • The check bits of the subfield 504 a 2 are the second portion of the check bits of the long word ECC encoding in the high temperature mode and thus the check bits of the field 504 a 2 are referred to herein as the long word extension check bits or ECC extension data. In one example, the field 504 a 2 of a cache memory may be 61 bits long, for example. It is appreciated that in other embodiments, the field 504 a 2 may be longer or shorter, depending upon the particular application.
  • Accordingly, the check bits of the long ECC word encoding of the high temperature mode includes both the short word ECC check bits of the subfield 504 a 1 and the long word extension ECC check bits of the subfield 504 a 2, of the long word ECC check bits subfield 504 a. Conversely, the check bits of the short ECC word encoding of the low temperature mode may include just the short word ECC check bits of the subfield 504 a 1 of the ECC check bits field 504. Accordingly, in the low temperature error correction mode, the active check bits are stored in the main subarray 60 a and do not extend into the subarray 60 b, that is are not stored in the subarray 60 b. Thus, the check bits stored in the subarray 60 b are inactive and the subarray 60 b may be permitted to be dormant in which operation is suspended or otherwise operated at reduced or no power, to provide for reduced power consumption of the memory array 60 in the low temperature mode. In addition, because of the short word encoding and decoding, the LTM encoder and decoder may have reduced complexity as compared the counterpart HTM encoder and decoder, such that power consumption by the LTM encoder and decoder may be reduced as compared to that of the HTM encoder and decoder, in some applications.
  • The write data encoded with the long error correction code word encoding may subsequently be read, decoded and checked for errors. Thus, as another example of an I/O operation, a memory controller such as the memory controller 30 (FIG. 4a ) may perform memory read operations in the high temperature error correction mode. In one operation of a memory read operation (FIG. 3b ), a memory controller such as the memory controller 30 (FIG. 2) causes data previously encoded in a relatively long ECC code word encoding in the relatively high ECC level of the high temperature ECC mode, to be read from a memory such as the memory circuit 40. The long ECC code word encoded read data including the long ECC word check bits stored in the field 504 a, is read and decoded (block 342) by a high temperature mode (HTM) ECC decoder logic 416 (FIG. 4a ) of an ECC decoder logic 420 of the multiple mode ECC logic 70.
  • More specifically, the encoded read data is decoded from the high temperature ECC code encoding of the high temperature ECC encoder logic. As previously mentioned, the high temperature, long word ECC code may be a Reed-Solomon error correction code, for example, but it is appreciated that other error correction codes may be employed in a high temperature, long word error correction code of multiple mode error correction in accordance with the present description.
  • The ECC decoder logic 416 is configured to decode read data from the memory circuit, which has been encoded in accordance with the high temperature, long word error correction code encoding. As shown in FIG. 4a , ECC decoder logic 420 is disposed on a die which in this embodiment, is the memory controller die 400. It is appreciated that the ECC decoder logic 420 of multiple mode error correction in accordance with the present description, may be distributed over other types of dies such as the dies of processors, I/O controllers, etc.
  • In another operation of FIG. 3b , the encoded read data being processed by the high temperature mode (HTM) ECC decoder logic 416 may be checked (block 342, FIG. 3b ) by an error detection logic of the high temperature mode (HTM) ECC decoder logic, to determine if the decoded read data contains erroneous data. Such erroneous data may have been caused by corruption of the encoded write data which may have occurred at various stages such as when the encoded write data was transmitted to the memory cell array 60 or while the encoded write data was being stored in the memory cell array 60, or being read from the memory cell array 60 or processed by the ECC logic, for example.
  • As shown in FIG. 4b , the high temperature mode (HTM) ECC decoder logic 416 of FIG. 4a includes a high temperature mode error detection logic 422 which utilizes the redundant data which was added by the high temperature, long word code encoding as described above, to detect whether the read data being decoded contains errors as indicated by the redundant data stored in the long word ECC check data field 504 a (FIG. 5a ) of the line of memory being read. If error detection logic 422 detects errors, the errors may be corrected (block 342, FIG. 3b ) by suitable correction logic such as the high temperature mode error correction logic 424 (FIG. 4b ) of the high temperature mode (HTM) ECC decoder logic 416. Upon completion of the decoding by the high temperature mode (HTM) ECC decoder logic 416, the encoded read data has been fully decoded (and corrected if errors detected) and may be forwarded to the entity requesting the read data.
  • As previously mentioned, the long word ECC code of one embodiment may be a Reed-Solomon code, for example. Numerous algorithms may be suitable for decoding Reed-Solomon ECC codes. Examples may include Peterson decoders, Berlekamp-Massey decoders, Euclidean decoders, time domain decoders, frequency domain decoders, etc. An outer ECC decoder logic may be implemented in one or more of hardware, software and firmware.
  • In another operation, a determination (block 346, FIG. 3b ) may be made as to whether to continue in the high temperature ECC mode. Such a determination may be made as a function of the output of the memory temperature sensor 80 (FIG. 4a ). For example, if the output of the memory temperature sensor 80 indicates that the temperature of the memory remains above the threshold, the multiple mode error correction may remain in the high temperature mode and await the next I/O operation (block 334). Conversely, if the output of the memory temperature sensor 80 indicates that the temperature of the memory has fallen below the threshold, the multiple mode error correction may transition to the low temperature error correction mode described below in connection with FIG. 3 d.
  • FIG. 3c is directed to one example of operations which may be undertaken in preparation (block 330, FIG. 3b ) for the high temperature ECC mode. In one embodiment, such preparations may include scanning (block 350, FIG. 3c ) each line of the memory array 60 to determine if there are lines of memory which were previously encoded in accordance with a low temperature, short word ECC mode prior to the multiple mode ECC logic entering the high temperature ECC mode.
  • In the example of FIG. 5a , the memory array 60 is part of a cache memory. Accordingly, the main array 60 a includes a field 510 of tag data bits for each line of data stored in the cache memory. The tag data field 510 includes a subfield 510 a to store bits used in connection with cache operations. The tag data field 510 further includes, in this embodiment, subfields 510 b, 510 c to store ECC status bits in connection with error correction encoding and decoding operations of the associated line stored in the cache memory. In one embodiment, the status bits of the subfields may be as few as one bit each, depending upon the particular application. Thus, status bits stored in the subfield 510 c, for example, may be indicate whether the associated line of data stored in the cache memory was encoded in a high temperature, long word ECC encoding, or was encoded in a low temperature, short word ECC encoding.
  • Accordingly, the multiple mode encoder logic 70 (FIG. 4a ) may read (block 350, FIG. 3c ) each line of the cache memory in turn and examine the associated encode status bit or bits of the encode status field 510 c to determined (block 352, FIG. 3c ) whether the associated line had been encoded in a low temperature, short word ECC encoding. If so, the high temperature ECC encoder logic 410 can re-encode (block 354) the line of data in a high temperature, long word ECC encoding to prepare that line of data for high temperature operations. For example, lines of cache memory which were encoded in accordance with the low temperature ECC mode may be re-encoded in accordance with the high temperature mode so as to better withstand the higher RBER expected in high temperature operation of the memory.
  • Once the line of data has been re-encoded in a high temperature, long word ECC encoding, the associated encode status bit or bits of the encode status field 510 c may be set (block 356) to indicate that the associated line had been encoded in a high temperature, long word ECC encoding. In addition, an associated decode status bit or bits of the decode status field 510 b may be set (block 356) to indicate that the associated line should be decoded in a high temperature, long word ECC decoding. Conversely, if it is determined (block 352, FIG. 3c ) that associated line of cache memory has already been encoded in a high temperature, long word ECC encoding as indicated by the encode status field 510 c, the high temperature, long word ECC re-encoding of that line of memory may be bypassed. In addition, the setting of encode and decode status fields may be skipped for the associated line of memory in that the status bits will have already been set to indicate encoding and decoding in the high temperature, long word ECC mode.
  • If it is determined (block 358) that additional lines of the cache memory remain to be scanned, the scanning and re-encoding from low temperature, short word ECC encoding to high temperature, long word ECC encoding may be repeated for each line of cache memory as appropriate until all the lines of cache memory have been scanned and re-encoded as needed. Once all the lines of cache memory have been scanned and re-encoded as needed, the multiple mode error correction logic may proceed to handle I/O operations (block 334, FIG. 3b ) as described above in accordance with the high temperature mode.
  • FIG. 3d is directed to operations of the multiple mode ECC logic 70 in a low temperature ECC mode. In connection with entering (block 360, FIG. 3d ) the low temperature ECC mode, the ECC mode controller logic 402 (FIG. 4a ) of the multiple mode ECC logic 70 sets (block 362) the ECC mode flag 406 (FIG. 4a ) to indicate that the multiple mode ECC logic 70 will be operating in the low temperature ECC mode. In accordance with a transition from a high temperature ECC mode to a low temperature ECC mode, preparations (block 364) may optionally be undertaken for the low temperature ECC mode. For example, it is appreciated that such preparations for a low temperature mode may be bypassed in embodiments in which a portion of a high temperature, long word ECC encoding, may be utilized for a low temperature, short word encoding, for example. Conversely, in other embodiments, the high temperature, long word encoding may be fully independent of the low temperature, short word encoding. As such, and as explained in greater detail below in connection with FIG. 3e , in one embodiment, preparations for low temperature mode may include scanning the memory array 60 to determine if there are lines of memory which were previously encoded in accordance with a high temperature ECC mode prior to the multiple mode ECC logic entering the low temperature ECC mode. If so, those lines of memory which were encoded in accordance with the high temperature ECC mode may be re-encoded in accordance with the low temperature mode to accommodate the lower RBER expected in low temperature operation of the memory.
  • Upon completion of the preparation (if any) of the memory for the low temperature operations, memory I/O operations (block 366) may be processed in a low temperature ECC mode. For example, a memory controller such as the memory controller 30 (FIG. 4a ) may receive write data to be written to a memory such as the memory circuit 40. Before the write data is written to the memory, the write data is encoded (block 368) by low temperature mode (LTM) ECC encoder logic 430 (FIG. 4a ) of an ECC Encoder Logic 412 of the multiple mode ECC Logic 70. The LTM ECC encoder logic 430 is configured to encode write data in a relatively short ECC code word encoding which provides a relatively low ECC level of error detection and correction capability in accordance with the relatively lower RBER expected during low temperature memory operations. Because the LTM ECC encoder logic 430 employs short word encoding, the complexity of the LTM ECC encoder logic 430 may be reduced as compared to that of the HTM ECC encoder logic 410. As a consequence, the power consumption of the LTM ECC encoder logic 430 in the low temperature mode, may be reduced as compared to that of the HTM ECC encoder logic 410 in the high temperature mode.
  • One example of a suitable low temperature, short word error correction code is a Reed-Solomon error correction code which is a block code which encodes blocks of data in symbols to facilitate error detection and correction. It is appreciated that other types of error correction codes may be employed in low temperature mode, multiple mode error correction in accordance with the present description. For example, it is believed that both block and convolutional codes may be suitable as a short word code for multiple mode error correction in accordance with the present description. Also, it is believed that both systematic and nonsystematic codes may be suitable as a short word code for multiple mode error correction in accordance with the present description. Additional examples of codes which may be suitable as short word codes are recursive codes, non-recursive codes, repetition codes, Hamming codes, multidimensional parity-check codes, turbo codes, low-density parity-check codes (LDPC), etc. The low temperature mode (LTM) ECC encoder logic 430 is configured to encode write data for the memory circuit in a short word error correction code (such as a Reed-Solomon error correction code, for example) of the temperature dependent, multiple mode error correction in accordance with one embodiment of the present description.
  • In one embodiment, the low temperature mode (LTM) ECC encoder logic 430 encodes the write data in a short error correction code word which adds redundant data such as check bits for example. Examples of redundant check bit data which may be added for purposes of error detection and correction include parity bits, repetition codes, checksums and cyclic redundancy checks (CRCs). The write data encoded with the check bits of the short error correction code word encoding may be stored (block 370, FIG. 3d ) in a memory such as the memory array 60 (FIG. 5a ). As explained in greater detail below, the write data encoded in the short error correction code word encoding may subsequently be decoded and checked for errors. In many cases, depending upon the severity of the detected errors, the decoded data may be corrected and restored to its original error-free form before it was written to the memory.
  • As previously mentioned, the long ECC word check bits field 504 a includes a first subfield 504 a 1 utilized to store a first portion of the check bits of the ECC encoding in the high temperature error correction mode. In the low temperature, short word ECC encoding, the check bits generated may be stored in the same field 504 a 1 which is designated the short word ECC data field 504 a 1 in FIG. 5a . Thus, check bits of the short ECC word encoding of the low temperature mode may include just the short word ECC check bits of the subfield 504 a 1 of the ECC check bits field 504. Accordingly, in the low temperature error correction mode, the subarray 60 b containing the field 504 a 2 utilized in high temperature, long word ECC encoding, may be dormant or otherwise operated at reduced or no power, to provide for reduced power consumption of the memory array 60 a in the low temperature mode.
  • In another operation, the encode status field 510 c for the line of data written to memory may be set (block 372) to indicate that the line of data was encoded in a low temperature, short word ECC encoding. Similarly, the decode status field 510 b may be set (block 372) for the line of data written to memory to indicate that the line of data is to be decoded in a low temperature, short word ECC decoding.
  • The write data encoded in a multiple mode ECC encoding may subsequently be read, decoded and checked for errors. Thus, as another example of an I/O operation, a memory controller such as the memory controller 30 (FIG. 4a ) may perform memory read operations in the low temperature error correction mode. In one operation of a memory read operation (FIG. 3d ), a memory controller such as the memory controller 30 (FIG. 2) causes previously encoded data to be read and the decode status for that line of data checked (block 374). If the decode status field 510 b of the line of data read from the memory indicates that the line of data was previously encoded in the low temperature, short word ECC encoding, the low temperature, short word encoded read data may be decoded (block 376) by a low temperature mode (LTM) ECC decoder logic 436 (FIG. 4a ) of the ECC decoder logic 420 of the multiple mode ECC logic 70. In one embodiment, the low temperature, short word decoding may be a less robust error detection and correction level in keeping with the lower RBER expected in low temperature operations. Because the LTM ECC decoder logic 436 employs short word decoding, the complexity of the LTM ECC decoder logic 436 may be reduced as compared to that of the HTM ECC decoder logic 416. As a consequence, the power consumption of the LTM ECC decoder logic 436 in the low temperature mode, may be reduced as compared to that of the HTM ECC decoder logic 416 in the high temperature mode.
  • Thus, the encoded read data is decoded from the low temperature, short word ECC encoding provided by the low temperature ECC encoder logic. As previously mentioned, the low temperature, short word ECC code may be a Reed-Solomon error correction code, for example, but it is appreciated that other error correction codes may be employed in a low temperature, short word error correction code decoding of multiple mode error correction in accordance with the present description.
  • FIG. 4a shows an embodiment of the memory controller die 400 of the memory controller 30 which includes the low temperature mode (LTM) ECC decoder logic 436 of an ECC decoder logic 420 of the multiple mode ECC logic 70. The ECC decoder logic 436 is configured to decode in accordance with the low temperature, short word error correction code, encoded read data from the memory circuit.
  • In another operation of FIG. 3b , the encoded read data being processed by the low temperature mode (LTM) ECC decoder logic 436 may be checked (block 376, FIG. 3d ) by an error detection logic of the low temperature mode (LTM) ECC decoder logic, to determine if the decoded read data contains erroneous data. Such erroneous data may have been caused by corruption of the encoded write data which may have occurred at various stages such as when the encoded write data was transmitted to the memory cell array 60 or while the encoded write data was being stored in the memory cell array 60, or being read from the memory cell array 60 or processed by the ECC logic, for example.
  • As shown in FIG. 4c , the low temperature mode (LTM) ECC decoder logic 436 of FIG. 4a includes a low temperature mode error detection logic 442 which utilizes the redundant data which was added by the low temperature, short word code encoding as described above, to detect whether the read data being decoded contains errors as indicated by the redundant data stored in the short word ECC data field 504 a 1 (FIG. 5a ) of the line of memory being read. If error detection logic 442 detects errors, the errors may be corrected (block 376, FIG. 3d ) by suitable correction logic such as the low temperature mode error correction logic 444 (FIG. 4c ) of the low temperature mode (LTM) ECC decoder logic 436. Upon completion of the decoding by the low temperature mode (LTM) ECC decoder logic 436, the encoded read data has been fully decoded (and corrected if errors detected) and may be forwarded to the entity requesting the read data.
  • As previously mentioned, the low temperature, short word ECC code of one embodiment may be a Reed-Solomon code, for example. Numerous algorithms may be suitable for decoding Reed-Solomon ECC codes. Examples may include Peterson decoders, Berlekamp-Massey decoders, Euclidean decoders, time domain decoders, frequency domain decoders, etc. An outer ECC decoder logic may be implemented in one or more of hardware, software and firmware.
  • As previously mentioned, a memory controller such as the memory controller 30 (FIG. 2) causes previously encoded data to be read and the decode status for that line of data checked (block 374). If the decode status field 510 b of the line of data read from the memory indicates that the line of data is to be decoded from the high temperature, long word ECC encoding, the high temperature, long word encoded read data may be decoded (block 378) by the high temperature mode (HTM) ECC decoder logic 416 (FIG. 4a ) as described above. Accordingly, data which was encoded for high temperature operations using high temperature, long word encoding, is decoded using the high temperature, long word encoding to provide the more robust level of error detection and correction to compensate for a higher RBER which may have been encountered while the line of data was stored in the memory. In subsequent readings of the line of data, the line of data may be decoded using the low temperature, short word ECC encoding in keeping with the mode of operation of the multiple mode error correction being in the low temperature mode of operation. Accordingly, the decode status is set (block 380) for the line of data read from memory to indicate that the line of data in subsequent readings is to be decoded in a low temperature, short word ECC decoding.
  • In the embodiment of FIG. 5a , the check bits encoded with the high temperature, long word ECC encoding and stored in the short word ECC data field 504 a 1 may be utilized for the low temperature, short word decoding described above. Hence, the check bits of the field 504 a 1 are common bits to both a low temperature, short word ECC encoding and a high temperature, long word encoding. Hence, where the check bits of the field 504 a 1 may be utilized for both low temperature, short word decoding and high temperature, long word decoding, the check bits of the field 504 a 1 need not be re-encoded when transitioning from a high temperature mode of operations to a low temperature mode of operations. U.S. Pat. No. 8,533,572 describes error correction for caches which can utilize a common set of check bits. However, in other embodiments, such as where the low temperature, short word encoding and the high temperature, long word encoding may not share check bits, a re-encoding (block 382) of the data to a low temperature, short word encoding may be appropriate.
  • In another operation, a determination (block 384, FIG. 3b ) may be made as to whether to continue in the low temperature ECC mode. Such a determination may be made as a function of the output of the memory temperature sensor 80 (FIG. 4a ). For example, if the output of the memory temperature sensor 80 indicates that the temperature of the memory remains below the threshold, the multiple mode error correction may remain in the low temperature mode and await the next I/O operation (block 366). Conversely, if the output of the memory temperature sensor 80 indicates that the temperature of the memory has risen above the threshold, the multiple mode error correction may transition to the high temperature error correction mode described above in connection with FIG. 3 b.
  • FIG. 5b is directed to an embodiment which differs from the embodiment of FIG. 5a in that the high temperature, long word ECC encoding does not share check bits with the low temperature, short word ECC encoding. Accordingly, the check bits stored in a short word ECC data subfield 504 b of the ECC data field 504 are independent of check bits stored in the long word ECC data field 504 a of the ECC data field 504. As a result, check bits generated by high temperature, long word ECC encoding are not used as check bits in a low temperature mode. Instead, high temperature, long word ECC encodings are re-encoded to a low temperature, short word ECC encoding when transitioning from a high temperature mode to a low temperature mode.
  • Similarly, FIG. 5c is directed to an embodiment which differs from the embodiment of FIG. 5a in that the high temperature, long word ECC encoding does not share check bits with the low temperature, short word ECC encoding. In addition, the ECC correction of FIG. 5c utilizes a non-systematic code, such that the original write data is transformed into an encoded message that typically has at least as many bits as the original message. Accordingly, in the embodiment of FIG. 5c , in the high temperature, long word ECC encoding, the original write data and the redundant check data are transformed into an encoded message stored in a long word ECC encoded write data field 520 which spans both the main array 60 a and the sub-array 60 b. In the low temperature, short word ECC encoding, the original write data and the redundant check data are transformed into a shorter encoded message stored in a short word ECC encoded write data field 524 within the main array 60 a and does not extend into sub-array 60 b.
  • Thus, in both of the embodiments of FIGS. 5b and 5c , as well as the embodiment of FIG. 5a , the low temperature, short word encoding does not extend into the sub-array 60 b. Hence, in the low temperature mode, the sub-array 60 b may be dormant to reduce power consumption in each of the embodiments of FIG. 5a -5 c.
  • However, because the low temperature, short word ECC encoding of the embodiments of FIGS. 5b, 5c are independent of the high temperature, long word ECC encodings such that check bits are not shared between the two modes of encoding, the high temperature encodings of the high temperature mode may be re-encoded to low temperature encodings when entering the low temperature mode from the high temperature mode.
  • FIG. 3e is directed to one example of operations which may be undertaken in preparation (block 330, FIG. 3b ) for operating in the low temperature ECC mode. In one embodiment, such preparations may include scanning (block 386, FIG. 3e ) each line of the memory array 60. Accordingly, the multiple mode encoder logic 70 (FIG. 4a ) may read (block 386, FIG. 3e ) each line of the cache memory in turn and the low temperature ECC encoder logic 430 can re-encode (block 390) the line of data in a low temperature, short word ECC encoding to prepare that line of data for low temperature operations so as to take advantage of the lower RBER expected in low temperature operation of the memory.
  • Once the line of data has been re-encoded in a low temperature, short word ECC encoding, the associated encode status bit or bits of the encode status field 510 c may be set (block 392) to indicate that the associated line had been encoded in a low temperature, short word ECC encoding. In addition, associated decode status bit or bits of the decode status field 510 b may be set (block 392) to indicate that the associated line should be decoded in a low temperature, short word ECC decoding.
  • If it is determined (block 394) that additional lines of the cache memory remain to be scanned, the scanning and re-encoding from high temperature, long word ECC encoding to low temperature, short word ECC encoding may be repeated for each line of cache memory as appropriate until all the lines of cache memory have been scanned and re-encoded. Once all the lines of cache memory have been scanned and re-encoded, the multiple mode error correction logic may proceed to handle I/O operations as described above in accordance with the low temperature mode.
  • It is seen from the above, that temperature dependent, multiple mode error correction in accordance with the present description, may reduce power consumption in a low temperature mode. Other aspects and advantages may be achieved, depending upon the particular application.
  • EXAMPLES
  • The following examples pertain to further embodiments.
  • Example 1 is apparatus, comprising: an array of memory cells; a temperature sensor coupled to the array and having an output, said temperature sensor configured to provide an output signal at said output wherein the output signal is a function of the temperature of the array of memory cells; a memory controller electrically coupled to the array of cells of the memory circuit and configured to control the array of memory cells; and multiple mode error correction code logic having an input coupled to the temperature sensor output, said multiple mode error correction code logic configured to encode write data for the array of memory cells in an error correction code in one of a plurality of error correction modes as a function of the temperature of the array of memory cells.
  • In Example 2, the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein said plurality of error correction modes includes a first error correction mode at a first error correction level at a first memory cell array temperature level, and further includes a second error correction mode at a second error correction level lower than the first error correction level at a second memory cell array temperature level lower than the first memory cell array temperature level.
  • In Example 3, the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein the array of memory cells includes a subarray, and wherein the multiple mode error correction code logic is configured to encode write data for the array of memory cells in the first error correction mode at the first error correction level to provide an error correction code word having a first length, at least a portion of which is stored in the subarray of memory cells, and is configured to encode write data for the array of memory cells in the second error correction mode at the second error correction level to provide an error correction code word having a second length shorter than the first length and is not stored within the subarray, is configured to decode read data from the array of memory cells encoded in an error correction code word having the first length, and is configured to decode read data from the array of memory cells encoded in an error correction code word having a second length shorter than the first length.
  • In Example 4, the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein said multiple mode error correction code logic is further configured to set an encode status flag to indicate whether the write data associated with the encode status flag was encoded at the first error correction level or at the second error correction level lower than the first error correction level, and wherein said multiple mode error correction code logic is further configured to enter the first error correction mode in response to the temperature sensor output signal indicating that the temperature of the array of memory cells has exceeded a threshold level, and to scan the array of memory cells to detect whether the encode status flag has been set for write data and to re-encode write data at the first error correction level in response to a detection of the encode status flag being set for the associated write data indicating that the associated write data had been encoded at the second error correction level.
  • In Example 5, the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein said multiple mode error correction code logic is further configured to reset an encode status flag for write data re-encoded at the first error correction level to indicate that the write data associated with the reset encode status flag was re-encoded at the first error correction level higher than the second error correction level.
  • In Example 6, the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein the multiple mode error correction code logic is further configured to set a decode status flag for read data to indicate the error correction level at which the associated read data may be decoded.
  • In Example 7, the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein said multiple mode error correction code logic is further configured to: enter the second error correction mode in response to the temperature sensor output signal indicating that the temperature of the array of memory cells has fallen below a threshold level; detect the decode status flag; read and decode data at the second error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data may be decoded at the second error correction level, and read and decode write data at the first error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data is to be decoded at the first error correction level, and reset the decode status flag to indicate that the read data may be decoded at the second error correction level.
  • In Example 8, the subject matter of Examples 1-8 (excluding the present example) can optionally include wherein the array of memory cells is a spin-transfer-torque (STT) random-access-memory (RAM) cache.
  • Example 9 is directed to a computing system for use with a display, comprising: a memory wherein the memory includes a memory circuit having an array of memory cells and a memory controller configured to control the array of memory cells; a processor configured to write data in and read data from the memory; a video controller configured to display information represented by data in the memory; a temperature sensor coupled to the array and having an output, said temperature sensor configured to provide an output signal at said output wherein the output signal is a function of the temperature of the array of memory cells; and multiple mode error correction code logic having an input coupled to the temperature sensor output, said multiple mode error correction code logic configured to encode write data for the array of memory cells in an error correction code in one of a plurality of error correction modes as a function of the temperature of the array of memory cells.
  • In Example 10, the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein said plurality of error correction modes includes a first error correction mode at a first error correction level at a first memory cell array temperature level, and further includes a second error correction mode at a second error correction level lower than the first error correction level at a second memory cell array temperature level lower than the first memory cell array temperature level.
  • In Example 11, the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein the array of memory cells includes a subarray, and wherein the multiple mode error correction code logic is configured to encode write data for the array of memory cells in the first error correction mode at the first error correction level to provide an error correction code word having a first length, at least a portion of which is stored in the subarray of memory cells, and is configured to encode write data for the array of memory cells in the second error correction mode at the second error correction level to provide an error correction code word having a second length shorter than the first length and is not stored within the subarray, is configured to decode read data from the array of memory cells encoded in an error correction code word having the first length, and is configured to decode read data from the array of memory cells encoded in an error correction code word having a second length shorter than the first length.
  • In Example 12, the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein said multiple mode error correction code logic is further configured to set an encode status flag to indicate whether the write data associated with the encode status flag was encoded at the first error correction level or at the second error correction level lower than the first error correction level, and wherein said multiple mode error correction code logic is further configured to enter the first error correction mode in response to the temperature sensor output signal indicating that the temperature of the array of memory cells has exceeded a threshold level, and to scan the array of memory cells to detect whether the encode status flag has been set for write data and to re-encode write data at the first error correction level in response to a detection of the encode status flag being set for the associated write data indicating that the associated write data had been encoded at the second error correction level.
  • In Example 13, the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein said multiple mode error correction code logic is further configured to reset an encode status flag for write data re-encoded at the first error correction level to indicate that the write data associated with the reset encode status flag was re-encoded at the first error correction level higher than the second error correction level.
  • In Example 14, the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein the multiple mode error correction code logic is further configured to set a decode status flag for read data to indicate the error correction level at which the associated read data may be decoded.
  • In Example 15, the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein said multiple mode error correction code logic is further configured to: enter the second error correction mode in response to the temperature sensor output signal indicating that the temperature of the array of memory cells has fallen below a threshold level; detect the decode status flag; read and decode data at the second error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data may be decoded at the second error correction level, and read and decode write data at the first error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data is to be decoded at the first error correction level, and reset the decode status flag to indicate that the read data may be decoded at the second error correction level.
  • In Example 16, the subject matter of Examples 9-16 (excluding the present example) can optionally include wherein the array of memory cells is a spin-transfer-torque (STT) random-access-memory (RAM) cache.
  • Example 17 is a method, comprising: sensing the temperature of an array of memory cells using a temperature sensor coupled to the array; and performing error correction processing of data using multiple mode error correction code logic, said processing including encoding write data for the array of memory cells in an error correction code in one of a plurality of error correction modes as a function of the temperature of the array of memory cells.
  • In Example 18, the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein said encoding write data for the array of memory cells in an error correction code in one of a plurality of error correction modes includes encoding write data in a first error correction mode at a first error correction level at a first memory cell array temperature level, and further includes encoding write data in a second error correction mode at a second error correction level lower than the first error correction level at a second memory cell array temperature level lower than the first memory cell array temperature level.
  • In Example 19, the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein the array of memory cells includes a subarray, and wherein the encoding write data in a first error correction mode includes providing an error correction code word having a first length, and storing at least a portion of the error correction code word having the first length in the subarray of memory cells, and encoding write data in a second error correction mode includes providing an error correction code word having a second length shorter than the first length, storing the error correction having the second length outside the subarray and suspending operation of the subarray, said processing further including decoding read data from the array of memory cells encoded in an error correction code word having the first length, and decoding read data from the array of memory cells encoded in an error correction code word having a second length shorter than the first length.
  • In Example 20, the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein said processing further includes setting an encode status flag to indicate whether the write data associated with the encode status flag was encoded at the first error correction level or at the second error correction level lower than the first error correction level, and further includes entering the first error correction mode in response to the temperature sensor indicating that the temperature of the array of memory cells has exceeded a threshold level, and scanning the array of memory cells to detect the encode status flag and re-encoding write data at the first error correction level in response to a detection of the encode status flag for the associated write data indicating that the associated write data had been encoded at the second error correction level.
  • In Example 21, the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein the processing further includes resetting an encode status flag for write data re-encoded at the first error correction level to indicate that the write data associated with the reset encode status flag was re-encoded at the first error correction level higher than the second error correction level.
  • In Example 22, the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein the processing further includes setting a decode status flag for read data to indicate the error correction level at which the associated read data may be decoded.
  • In Example 23, the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein the processing further includes: entering the second error correction mode in response to the temperature sensor indicating that the temperature of the array of memory cells has fallen below a threshold level; detecting the decode status flag; reading and decoding read data at the second error correction level in response to a detection of the decode status flag indicating that the read data may be decoded at the second error correction level, and reading and decoding read data at the first error correction level in response to a detection of the decode status flag indicating that the read data is to be decoded at the first error correction level, and resetting the decode status flag to indicate that the read data may be decoded at the second error correction level.
  • In Example 24, the subject matter of Examples 17-24 (excluding the present example) can optionally include wherein the array of memory cells is a spin-transfer-torque (STT) random-access-memory (RAM) cache.
  • Example 26 is an apparatus comprising means to perform a method as claimed in any preceding example.
  • The described operations may be implemented as a method, apparatus or computer program product using standard programming and/or engineering techniques to produce software, firmware, hardware, or any combination thereof. The described operations may be implemented as computer program code maintained in a “computer readable storage medium”, where a processor may read and execute the code from the computer storage readable medium. The computer readable storage medium includes at least one of electronic circuitry, storage materials, inorganic materials, organic materials, biological materials, a casing, a housing, a coating, and hardware. A computer readable storage medium may comprise, but is not limited to, a magnetic storage medium (e.g., hard disk drives, floppy disks, tape, etc.), optical storage (CD-ROMs, DVDs, optical disks, etc.), volatile and non-volatile memory devices (e.g., EEPROMs, ROMs, PROMs, RAMs, DRAMs, SRAMs, Flash Memory, firmware, programmable logic, etc.), Solid State Devices (SSD), etc. The code implementing the described operations may further be implemented in hardware logic implemented in a hardware device (e.g., an integrated circuit chip, Programmable Gate Array (PGA), Application Specific Integrated Circuit (ASIC), etc.). Still further, the code implementing the described operations may be implemented in “transmission signals”, where transmission signals may propagate through space or through a transmission media, such as an optical fiber, copper wire, etc. The transmission signals in which the code or logic is encoded may further comprise a wireless signal, satellite transmission, radio waves, infrared signals, Bluetooth, etc. The program code embedded on a computer readable storage medium may be transmitted as transmission signals from a transmitting station or computer to a receiving station or computer. A computer readable storage medium is not comprised solely of transmissions signals. Those skilled in the art will recognize that many modifications may be made to this configuration without departing from the scope of the present description, and that the article of manufacture may comprise suitable information bearing medium known in the art. Of course, those skilled in the art will recognize that many modifications may be made to this configuration without departing from the scope of the present description, and that the article of manufacture may comprise any tangible information bearing medium known in the art.
  • In certain applications, a device in accordance with the present description, may be embodied in a computer system including a video controller to render information to display on a monitor or other display coupled to the computer system, a device driver and a network controller, such as a computer system comprising a desktop, workstation, server, mainframe, laptop, handheld computer, etc. Alternatively, the device embodiments may be embodied in a computing device that does not include, for example, a video controller, such as a switch, router, etc., or does not include a network controller, for example.
  • The illustrated logic of figures may show certain events occurring in a certain order. In alternative embodiments, certain operations may be performed in a different order, modified or removed. Moreover, operations may be added to the above described logic and still conform to the described embodiments. Further, operations described herein may occur sequentially or certain operations may be processed in parallel. Yet further, operations may be performed by a single processing unit or by distributed processing units.
  • The foregoing description of various embodiments has been presented for the purposes of illustration and description. It is not intended to be exhaustive or to limit to the precise form disclosed. Many modifications and variations are possible in light of the above teaching.

Claims (24)

What is claimed is:
1. An apparatus, comprising:
an array of memory cells;
a temperature sensor coupled to the array and having an output, said temperature sensor configured to provide an output signal at said output wherein the output signal is a function of the temperature of the array of memory cells;
a memory controller electrically coupled to the array of cells of the memory circuit and configured to control the array of memory cells; and
multiple mode error correction code logic having an input coupled to the temperature sensor output, said multiple mode error correction code logic configured to encode write data for the array of memory cells in an error correction code in one of a plurality of error correction modes as a function of the temperature of the array of memory cells.
2. The apparatus of claim 1 wherein said plurality of error correction modes includes a first error correction mode at a first error correction level at a first memory cell array temperature level, and further includes a second error correction mode at a second error correction level lower than the first error correction level at a second memory cell array temperature level lower than the first memory cell array temperature level.
3. The apparatus of claim 2 wherein the array of memory cells includes a subarray, and wherein the multiple mode error correction code logic is configured to encode write data for the array of memory cells in the first error correction mode at the first error correction level to provide an error correction code word having a first length, at least a portion of which is stored in the subarray of memory cells, and is configured to encode write data for the array of memory cells in the second error correction mode at the second error correction level to provide an error correction code word having a second length shorter than the first length and is not stored within the subarray, is configured to decode read data from the array of memory cells encoded in an error correction code word having the first length, and is configured to decode read data from the array of memory cells encoded in an error correction code word having a second length shorter than the first length.
4. The apparatus of claim 3 wherein said multiple mode error correction code logic is further configured to set an encode status flag to indicate whether the write data associated with the encode status flag was encoded at the first error correction level or at the second error correction level lower than the first error correction level, and wherein said multiple mode error correction code logic is further configured to enter the first error correction mode in response to the temperature sensor output signal indicating that the temperature of the array of memory cells has exceeded a threshold level, and to scan the array of memory cells to detect whether the encode status flag has been set for write data and to re-encode write data at the first error correction level in response to a detection of the encode status flag being set for the associated write data indicating that the associated write data had been encoded at the second error correction level.
5. The apparatus of claim 4 wherein said multiple mode error correction code logic is further configured to reset an encode status flag for write data re-encoded at the first error correction level to indicate that the write data associated with the reset encode status flag was re-encoded at the first error correction level higher than the second error correction level.
6. The apparatus of claim 5 wherein the multiple mode error correction code logic is further configured to set a decode status flag for read data to indicate the error correction level at which the associated read data may be decoded.
7. The apparatus of claim 6 wherein said multiple mode error correction code logic is further configured to:
enter the second error correction mode in response to the temperature sensor output signal indicating that the temperature of the array of memory cells has fallen below a threshold level;
detect the decode status flag;
read and decode data at the second error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data may be decoded at the second error correction level, and
read and decode write data at the first error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data is to be decoded at the first error correction level, and reset the decode status flag to indicate that the read data may be decoded at the second error correction level.
8. The apparatus of claim 1 wherein the array of memory cells is a spin-transfer-torque (STT) random-access-memory (RAM) cache.
9. A computing system for use with a display, comprising:
a memory wherein the memory includes a memory circuit having an array of memory cells and a memory controller configured to control the array of memory cells;
a processor configured to write data in and read data from the memory;
a video controller configured to display information represented by data in the memory;
a temperature sensor coupled to the array and having an output, said temperature sensor configured to provide an output signal at said output wherein the output signal is a function of the temperature of the array of memory cells; and
multiple mode error correction code logic having an input coupled to the temperature sensor output, said multiple mode error correction code logic configured to encode write data for the array of memory cells in an error correction code in one of a plurality of error correction modes as a function of the temperature of the array of memory cells.
10. The system of claim 9 wherein said plurality of error correction modes includes a first error correction mode at a first error correction level at a first memory cell array temperature level, and further includes a second error correction mode at a second error correction level lower than the first error correction level at a second memory cell array temperature level lower than the first memory cell array temperature level.
11. The system of claim 10 wherein the array of memory cells includes a subarray, and wherein the multiple mode error correction code logic is configured to encode write data for the array of memory cells in the first error correction mode at the first error correction level to provide an error correction code word having a first length, at least a portion of which is stored in the subarray of memory cells, and is configured to encode write data for the array of memory cells in the second error correction mode at the second error correction level to provide an error correction code word having a second length shorter than the first length and is not stored within the subarray, is configured to decode read data from the array of memory cells encoded in an error correction code word having the first length, and is configured to decode read data from the array of memory cells encoded in an error correction code word having a second length shorter than the first length.
12. The system of claim 11 wherein said multiple mode error correction code logic is further configured to set an encode status flag to indicate whether the write data associated with the encode status flag was encoded at the first error correction level or at the second error correction level lower than the first error correction level, and wherein said multiple mode error correction code logic is further configured to enter the first error correction mode in response to the temperature sensor output signal indicating that the temperature of the array of memory cells has exceeded a threshold level, and to scan the array of memory cells to detect whether the encode status flag has been set for write data and to re-encode write data at the first error correction level in response to a detection of the encode status flag being set for the associated write data indicating that the associated write data had been encoded at the second error correction level.
13. The system of claim 12 wherein said multiple mode error correction code logic is further configured to reset an encode status flag for write data re-encoded at the first error correction level to indicate that the write data associated with the reset encode status flag was re-encoded at the first error correction level higher than the second error correction level.
14. The system of claim 13 wherein the multiple mode error correction code logic is further configured to set a decode status flag for read data to indicate the error correction level at which the associated read data may be decoded.
15. The system of claim 14 wherein said multiple mode error correction code logic is further configured to:
enter the second error correction mode in response to the temperature sensor output signal indicating that the temperature of the array of memory cells has fallen below a threshold level;
detect the decode status flag;
read and decode data at the second error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data may be decoded at the second error correction level, and
read and decode write data at the first error correction level in response to a detection of the decode status flag being set for the associated read data to indicate that the read data is to be decoded at the first error correction level, and reset the decode status flag to indicate that the read data may be decoded at the second error correction level.
16. The system of claim 9 wherein the array of memory cells is a spin-transfer-torque (STT) random-access-memory (RAM) cache.
17. A method, comprising:
sensing the temperature of an array of memory cells using a temperature sensor coupled to the array; and
performing error correction processing of data using multiple mode error correction code logic, said processing including encoding write data for the array of memory cells in an error correction code in one of a plurality of error correction modes as a function of the temperature of the array of memory cells.
18. The method of claim 17 wherein said encoding write data for the array of memory cells in an error correction code in one of a plurality of error correction modes includes encoding write data in a first error correction mode at a first error correction level at a first memory cell array temperature level, and further includes encoding write data in a second error correction mode at a second error correction level lower than the first error correction level at a second memory cell array temperature level lower than the first memory cell array temperature level.
19. The method of claim 17 wherein the array of memory cells includes a subarray, and wherein the encoding write data in a first error correction mode includes providing an error correction code word having a first length, and storing at least a portion of the error correction code word having the first length in the subarray of memory cells, and encoding write data in a second error correction mode includes providing an error correction code word having a second length shorter than the first length, storing the error correction having the second length outside the subarray and suspending operation of the subarray, said processing further including decoding read data from the array of memory cells encoded in an error correction code word having the first length, and decoding read data from the array of memory cells encoded in an error correction code word having a second length shorter than the first length.
20. The method of claim 19 wherein said processing further includes setting an encode status flag to indicate whether the write data associated with the encode status flag was encoded at the first error correction level or at the second error correction level lower than the first error correction level, and further includes entering the first error correction mode in response to the temperature sensor indicating that the temperature of the array of memory cells has exceeded a threshold level, and scanning the array of memory cells to detect the encode status flag and re-encoding write data at the first error correction level in response to a detection of the encode status flag for the associated write data indicating that the associated write data had been encoded at the second error correction level.
21. The method of claim 20 wherein the processing further includes resetting an encode status flag for write data re-encoded at the first error correction level to indicate that the write data associated with the reset encode status flag was re-encoded at the first error correction level higher than the second error correction level.
22. The method of claim 21 wherein the processing further includes setting a decode status flag for read data to indicate the error correction level at which the associated read data may be decoded.
23. The method of claim 22 wherein the processing further includes:
entering the second error correction mode in response to the temperature sensor indicating that the temperature of the array of memory cells has fallen below a threshold level;
detecting the decode status flag;
reading and decoding read data at the second error correction level in response to a detection of the decode status flag indicating that the read data may be decoded at the second error correction level, and
reading and decoding read data at the first error correction level in response to a detection of the decode status flag indicating that the read data is to be decoded at the first error correction level, and resetting the decode status flag to indicate that the read data may be decoded at the second error correction level.
24. The method of claim 17 wherein the array of memory cells is a spin-transfer-torque (STT) random-access-memory (RAM) cache.
US14/929,163 2015-10-30 2015-10-30 Temperature dependent multiple mode error correction Abandoned US20170126249A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
US14/929,163 US20170126249A1 (en) 2015-10-30 2015-10-30 Temperature dependent multiple mode error correction
PCT/US2016/055033 WO2017074648A1 (en) 2015-10-30 2016-09-30 Temperature dependent multiple mode error correction
CN201680057316.1A CN108463807B (en) 2015-10-30 2016-09-30 Temperature dependent multi-mode error correction
EP16860481.7A EP3368984B1 (en) 2015-10-30 2016-09-30 Temperature dependent multiple mode error correction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US14/929,163 US20170126249A1 (en) 2015-10-30 2015-10-30 Temperature dependent multiple mode error correction

Publications (1)

Publication Number Publication Date
US20170126249A1 true US20170126249A1 (en) 2017-05-04

Family

ID=58632019

Family Applications (1)

Application Number Title Priority Date Filing Date
US14/929,163 Abandoned US20170126249A1 (en) 2015-10-30 2015-10-30 Temperature dependent multiple mode error correction

Country Status (4)

Country Link
US (1) US20170126249A1 (en)
EP (1) EP3368984B1 (en)
CN (1) CN108463807B (en)
WO (1) WO2017074648A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107025948A (en) * 2017-05-17 2017-08-08 西安紫光国芯半导体有限公司 The memory and error correction method of the error correction of data length are indicated with flag bit
US20180061474A1 (en) * 2016-08-23 2018-03-01 SK Hynix Inc. Semiconductor devices
CN108052414A (en) * 2017-12-28 2018-05-18 湖南国科微电子股份有限公司 A kind of method and system for promoting SSD operating temperature ranges
CN109284201A (en) * 2018-09-17 2019-01-29 至誉科技(武汉)有限公司 Temperature equalization data reconstruction method and system, storage medium
US20190196726A1 (en) * 2017-12-26 2019-06-27 Nanya Technology Corporation Dynamic random access memory and method of operating the same
US10438641B2 (en) * 2018-02-01 2019-10-08 Tdk Corporation Data writing method and magnetic memory
US20230058813A1 (en) * 2021-08-19 2023-02-23 Micron Technology, Inc. Adjustable data protection scheme using artificial intelligence
US11658685B2 (en) 2021-10-05 2023-05-23 Macronix International Co., Ltd. Memory with multi-mode ECC engine

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10795767B2 (en) * 2018-12-26 2020-10-06 M31 Technology Corporation Error correcting system shared by multiple memory devices

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130318418A1 (en) * 2012-05-22 2013-11-28 Politecncio di Milano Adaptive error correction for phase change memory

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100440375C (en) * 2004-12-31 2008-12-03 晶豪科技股份有限公司 Dynamic random access memory self-refresh circuit relating with temperature
US7900120B2 (en) * 2006-10-18 2011-03-01 Micron Technology, Inc. Memory system and method using ECC with flag bit to identify modified data
US7945840B2 (en) * 2007-02-12 2011-05-17 Micron Technology, Inc. Memory array error correction apparatus, systems, and methods
US8095851B2 (en) * 2007-09-06 2012-01-10 Siliconsystems, Inc. Storage subsystem capable of adjusting ECC settings based on monitored conditions
US8296620B2 (en) * 2008-08-26 2012-10-23 Seagate Technology Llc Data devices including multiple error correction codes and methods of utilizing
US8307261B2 (en) * 2009-05-04 2012-11-06 National Tsing Hua University Non-volatile memory management method
US8510628B2 (en) * 2009-11-12 2013-08-13 Micron Technology, Inc. Method and apparatuses for customizable error correction of memory
US9070473B2 (en) * 2009-12-02 2015-06-30 Micron Technology, Inc. Refresh architecture and algorithm for non-volatile memories
US8533564B2 (en) * 2009-12-23 2013-09-10 Sandisk Technologies Inc. System and method of error correction of control data at a memory device
EP2633409A4 (en) * 2010-10-27 2014-07-23 Lsi Corp Adaptive ecc techniques for flash memory based data storage
US8621324B2 (en) * 2010-12-10 2013-12-31 Qualcomm Incorporated Embedded DRAM having low power self-correction capability
US8954822B2 (en) * 2011-11-18 2015-02-10 Sandisk Enterprise Ip Llc Data encoder and decoder using memory-specific parity-check matrix
US9246516B2 (en) * 2012-12-20 2016-01-26 Intel Corporation Techniques for error correction of encoded data
US9164832B2 (en) * 2013-02-27 2015-10-20 Seagate Technology Llc ECC management for variable resistance memory cells
US9229801B2 (en) * 2013-09-24 2016-01-05 Sandisk Technologies Inc. Method and device for write abort protection
US9690640B2 (en) * 2013-09-26 2017-06-27 Intel Corporation Recovery from multiple data errors
CN203705084U (en) * 2013-12-19 2014-07-09 深圳市日海傲迈光测技术有限公司 Distributed optical fiber temperature measurement system
US20150261602A1 (en) * 2014-03-13 2015-09-17 Katsuyuki Fujita Resistance change memory

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130318418A1 (en) * 2012-05-22 2013-11-28 Politecncio di Milano Adaptive error correction for phase change memory

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Chen US pat pub 2010/0058125; hereinafter referred to as *
Prohofsky US pat pub 2015/0052416; hereinafter referred to as *
Wu US pat 2010/0281341; hereinafter referred to as *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180061474A1 (en) * 2016-08-23 2018-03-01 SK Hynix Inc. Semiconductor devices
US10096345B2 (en) * 2016-08-23 2018-10-09 SK Hynix Inc. Semiconductor devices
CN107025948A (en) * 2017-05-17 2017-08-08 西安紫光国芯半导体有限公司 The memory and error correction method of the error correction of data length are indicated with flag bit
US20190196726A1 (en) * 2017-12-26 2019-06-27 Nanya Technology Corporation Dynamic random access memory and method of operating the same
CN109961817A (en) * 2017-12-26 2019-07-02 南亚科技股份有限公司 Dynamic random access memory and its operating method
CN108052414A (en) * 2017-12-28 2018-05-18 湖南国科微电子股份有限公司 A kind of method and system for promoting SSD operating temperature ranges
US10438641B2 (en) * 2018-02-01 2019-10-08 Tdk Corporation Data writing method and magnetic memory
CN109284201A (en) * 2018-09-17 2019-01-29 至誉科技(武汉)有限公司 Temperature equalization data reconstruction method and system, storage medium
US20230058813A1 (en) * 2021-08-19 2023-02-23 Micron Technology, Inc. Adjustable data protection scheme using artificial intelligence
US11989421B2 (en) * 2021-08-19 2024-05-21 Micron Technology, Inc. Adjustable data protection scheme using artificial intelligence
US11658685B2 (en) 2021-10-05 2023-05-23 Macronix International Co., Ltd. Memory with multi-mode ECC engine

Also Published As

Publication number Publication date
CN108463807B (en) 2021-11-23
EP3368984A4 (en) 2019-04-24
WO2017074648A1 (en) 2017-05-04
EP3368984A1 (en) 2018-09-05
CN108463807A (en) 2018-08-28
EP3368984B1 (en) 2021-10-20

Similar Documents

Publication Publication Date Title
EP3368984B1 (en) Temperature dependent multiple mode error correction
US10048877B2 (en) Predictive memory maintenance
US9912355B2 (en) Distributed concatenated error correction
US9319073B2 (en) Mitigation of write errors in multi-level cell flash memory through adaptive error correction code decoding
TWI674767B (en) Turbo product codes for nand flash
US20190149175A1 (en) Stopping criteria for layered iterative error correction
US20190068222A1 (en) Data storage apparatus and operating method thereof
US9471423B1 (en) Selective memory error reporting
KR100842680B1 (en) Ecc controller for use in flash memory device and memory system including the same
KR100845529B1 (en) Ecc controller for use in flash memory device and memory system including the same
US8650458B2 (en) Methods of encoding/decoding for error correction code utilizing interdependent portions of codewords and related circuits
US20130318418A1 (en) Adaptive error correction for phase change memory
US9836351B2 (en) Method for decoding bits in a solid state drive, and related solid state drive
US11063607B2 (en) Compressing error vectors for decoding logic to store compressed in a decoder memory used by the decoding logic
US10075191B2 (en) Method for decoding bits-by varying LLR values in a solid-state drive, and solid-state drive implementing the method
US20230049851A1 (en) Ecc memory chip encoder and decoder
US20170186500A1 (en) Memory circuit defect correction
US9286156B2 (en) Data storage device and method for processing error correction code thereof
US10431324B2 (en) Data storage device for performing decoding operation and operating method thereof
CN110716824A (en) Encoding method and memory storage device using the same
TWI829252B (en) Method and computer program product and apparatus for decoding low-density parity-check (ldpc) code
WO2023202592A1 (en) Data writing method and processing system
KR20170067656A (en) Turbo product codes for nand flash

Legal Events

Date Code Title Description
AS Assignment

Owner name: INTEL CORPORATION, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WU, WEI;AUGUSTINE, CHARLES;TOMISHIMA, SHIGEKI;AND OTHERS;REEL/FRAME:037413/0822

Effective date: 20151020

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION