Apparatuses and Methods for Identifying an
Extremum Value Stored in an Array of Memory Cells
Technical Field
[0001] The present disclosure relates generally to semiconductor memory and methods, and more particularly, to apparatuses and methods related to identifying an extremum value stored in an array of memory cells.
Background
[0002] Memory devices are typically provided as internal,
semiconductor, integrated circuits in computers or other electronic systems. There are many different types of memory including volatile and non-volatile memory. Volatile memory can require power to maintain its data (e.g., host data, error data, etc.) and includes random access memory (RAM), dynamic random access memory (DRAM), static random access memory (SRAM), synchronous dynamic random access memory (SDRAM), and thyristor random access memory (TRAM), among others. Non-volatile memory can provide persistent data by retaining stored data when not powered and can include NAND flash memory, NOR flash memory, and resistance variable memory such as phase change random access memory (PCRAM), resistive random access memory (RRAM), and magnetoresistive random access memory (MRAM), such as spin torque transfer random access memory (STT RAM), among others.
[0003] Electronic systems often include a number of processing resources (e.g., one or more processors), which may retrieve and execute instructions and store the results of the executed instructions to a suitable location. A processor can comprise a number of functional units such as arithmetic logic unit (ALU) circuitry, floating point unit (FPU) circuitry, and/or a combinatorial logic block (referred to herein as functional unit circuitry (FUC)), for example, which can be used to execute instructions by performing logical operations such as AND, OR, NOT, NAND, NOR, and XOR logical operations on data (e.g., one or more operands). For example, the FUC may be used to perform arithmetic operations such as addition, subtraction,
multiplication, and/or division on operands.
[0004] A number of components in an electronic system may be involved in providing instructions to the FUC for execution. The instructions may be generated, for instance, by a processing resource such as a controller and/or host processor. Data (e.g., the operands on which the instructions will be executed) may be stored in a memory array that is accessible by the FUC. The instructions and/or data may be retrieved from the memory array and sequenced and/or buffered before the FUC begins to execute instructions on the data.
Furthermore, as different types of operations may be executed in one or multiple clock cycles through the FUC, intermediate results of the instructions and/or data may also be sequenced and/or buffered.
[0005] Executing instructions (e.g, as part of program execution) can involve performing operations such as compare operations and the results can be provided (e.g., reported) to the processing resources as part of the executional flow of an algorithm, for example. Compare operations can include identifying an extremum value (e.g., a maximum and/or minimum value) of a set of N data values. The number of compare operations can be 0(N) operations due to the comparison of each element to at least one other value to determine which one is greater and/or lesser.
Brief Description of the Drawings
[0006] Figure 1 is a block diagram of an apparatus in the form of a computing system including a memory device in accordance with a number of embodiments of the present disclosure.
[0007] Figure 2 illustrates a schematic diagram of a portion of a memory array coupled to sensing circuitry in accordance with a number of embodiments of the present disclosure.
[0008] Figure 3 illustrates a schematic diagram of a portion of a memory array coupled to sensing circuitry in accordance with a number of embodiments of the present disclosure.
[0009] Figure 4 illustrates an example of a method for identifying an extremum in accordance with a number of embodiments of the present disclosure.
[0010] Figure 5 illustrates an example of a method for identifying an extremum in accordance with a number of embodiments of the present disclosure.
Detailed Description
[0011] The present disclosure includes apparatuses and methods related to identifying an extremum (e.g., a maximum or minimum) value stored in an array of memory cells. An example method can include determining a location of an extremum value of a set of N data values stored as vectors (e.g., bit vectors) in a memory array. The determination of the location of the extremum value can include a number of operations that remain constant with respect to a value of N. For example, the number of operations used to determine the extremum value can be the same for ten data values stored as vectors in a memory array as the number of operations used to determine the extremum value for one hundred data values stored as vectors in the memory array. The method can include determining the value of the extremum by reading memory cells coupled to a sense line that store the extremum value.
[0012] A number of embodiments of the present disclosure can provide benefits such as determining a location of an extremum value (e.g., a maximum value and/or a minimum value) and determining the value of the extremum. For instance, a number of embodiments can provide for determining which sense line in a memory array is coupled to memory cells that store an extremum value, where each sense line is coupled to a respective plurality of memory cells that store a bit vector that represents a base ten (10) numerical value. An access line coupled to memory cells that store most-significant data (e.g., bit data) refers to the access line that is coupled to the memory cells that store data values for the highest index of the bit vectors. For example, the bit vector 01 can represent a numerical base ten (10) value of "1" while the bit vector 11 can represent a numerical base ten (10) value of 3. In both cases (i.e., the bit vectors 01 and 11), a binary data value of "1" is stored in the memory cells corresponding to the highest index of the bit vectors. In locating an extremum value (e.g., a maximum value), the highest index access line coupled to a memory cell storing a target data value can be determined. Such an identification of a maximum and/or minimum value can be associated with performing a number of logical
operations (e.g., AND, NOT, NOR, NAND, XOR, etc.). However,
embodiments are not limited to these examples.
[0013] The number of logical operations can be performed by an accumulation of data relative to a number of access lines (e.g., see Figure 3 for discussion of an accumulator). For example, a determination of whether a target data value (e.g., a binary 1 when determining a maximum value and/or a binary 0 when determining a minimum value) is stored in a memory cell coupled to an access line can be performed. For example, if a base ten (10) numerical value is represented by the bit vector 0011 stored in the memory cells coupled to sense line 205-2 in Figure 2, and the target data value is "1", the highest index access line corresponding to that bit vector that is coupled to a memory cell storing the target data value would be the access line 204-3 (as it is coupled to memory cell 203-10, which stores the first "1" in the bit vector).
[0014] In the following detailed description of the present disclosure, reference is made to the accompanying drawings that form a part hereof, and in which is shown by way of illustration how one or more embodiments of the disclosure may be practiced. These embodiments are described in sufficient detail to enable those of ordinary skill in the art to practice the embodiments of this disclosure, and it is to be understood that other embodiments may be utilized and that process, electrical, and/or structural changes may be made without departing from the scope of the present disclosure. As used herein, the designators "N," "P," "R," etc., particularly with respect to reference numerals in the drawings, can indicate that a number of the particular features so designated can be included. As used herein, "a number of a particular thing can refer to one or more of such things (e.g., a number of memory arrays can refer to one or more memory arrays).
[0015] The figures herein follow a numbering convention in which the first digit or digits correspond to the drawing figure number and the remaining digits identify an element or component in the drawing. Similar elements or components between different figures may be identified by the use of similar digits. For example, 130 may reference element "30" in Figure 1, and a similar element may be referenced as 230 in Figure 2. As will be appreciated, elements shown in the various embodiments herein can be added, exchanged, and/or eliminated so as to provide a number of additional embodiments of the present
disclosure. In addition, as will be appreciated, the proportion and the relative scale of the elements provided in the figures are intended to illustrate certain embodiments of the present invention, and should not be taken in a limiting sense.
[0016] Figure 1 is a block diagram of an apparatus in the form of a computing system 100 including a memory device 120 in accordance with a number of embodiments of the present disclosure. As used herein, a memory device 120, a memory array 130, and/or sensing circuitry 150 might also be separately considered an "apparatus."
[0017] System 100 includes a host 110 coupled to memory device 120, which includes a memory array 130. Host 110 can be a host system such as a personal laptop computer, a desktop computer, a digital camera, a mobile telephone, or a memory card reader, among various other types of hosts. Host 110 can include a system motherboard and/or backplane and can include a number of processing resources (e.g., one or more processors, microprocessors, or some other type of controlling circuitry). The system 100 can include separate integrated circuits or both the host 110 and the memory device 120 can be on the same integrated circuit. The system 100 can be, for instance, a server system and/or a high performance computing (HPC) system and/or a portion thereof. Although the example shown in Figure 1 illustrates a system having a Von Neumann architecture, embodiments of the present disclosure can be implemented in non-Von Neumann architectures (e.g., a Turing machine), which may not include one or more components (e.g., CPU, ALU, etc.) often associated with a Von Neumann architecture.
[0018] For clarity, the system 100 has been simplified to focus on features with particular relevance to the present disclosure. The memory array 130 can be a DRAM array, SRAM array, STT RAM array, PCRAM array, TRAM array, RRAM array, NAND flash array, and/or NOR flash array, for instance. The array 130 can comprise memory cells arranged in rows coupled by access lines (which may be referred to herein as row lines, word lines or select lines) and columns coupled by sense lines (which may be referred to herein as bit lines, digit lines, or data lines). Although a single array 130 is shown in Figure 1, embodiments are not so limited. For instance, memory device 120
may include a number of arrays 130 (e.g., a number of banks of DRAM cells). An example DRAM array is described in association with Figures 2 and 3.
[0019] The memory device 120 includes address circuitry 142 to latch address signals provided over an I/O bus 156 (e.g., a data bus) through I/O circuitry 144. Address signals are received and decoded by a row decoder 146 and a column decoder 152 to access the memory array 130. Data can be read from memory array 130 by sensing voltage and/or current changes on the sense lines using sensing circuitry 150. The sensing circuitry 150 can read and latch a page (e.g., row) of data from the memory array 130. The I/O circuitry 144 can be used for bi-directional data communication with host 110 over the I/O bus 156. The write circuitry 148 is used to write data to the memory array 130.
[0020] Control circuitry 140 decodes signals provided by control bus 154 from the host 110. These signals can include chip enable signals, write enable signals, and address latch signals that are used to control operations performed on the memory array 130, including data read, data write, and data erase operations. In various embodiments, the control circuitry 140 is responsible for executing instructions from the host 110. The control circuitry 140 can be a state machine, a sequencer, or some other type of controller (e.g., an on-die controller).
[0021] An example of the sensing circuitry 150 is described further below in association with Figures 2, 3, and 4. For instance, in a number of embodiments, the sensing circuitry 150 can comprise a number of sense amplifiers (e.g., sense amplifiers 206-1,..., 206-P shown in Figure 2 or sense amplifier 306 shown in Figure 3) and a number of accumulators (e.g., accumulators 231-1 through 231-X shown in Figure 2 and accumulator 331 shown in Figure 3). As illustrated in Figure 3, the accumulators can comprise cross-coupled transistors that can serve as a data latches and can be coupled to other sensing circuitry used to perform a number of logical operations (e.g., AND, NOT, NOR, NAND, XOR, etc.). In a number of embodiments, the sensing circuitry (e.g., 150) can be used to perform an AND operation using data stored in array 130 as inputs and store the results of the logical operations back to the array 130 without transferring via a sense line address access (e.g., without firing a column decode signal). As such, various compute functions can be performed within array 130 using sensing circuitry 150 rather than being
performed by processing resources external to the sensing circuitry (e.g., by a processor associated with host 110 and/or other processing circuitry, such as ALU circuitry, located on device 120 (e.g., on control circuitry 140 or elsewhere)). In various previous approaches, data associated with an operand, for instance, would be read from memory via sensing circuitry and provided to external ALU circuitry via local I/O lines. The external ALU circuitry would perform compute functions using the operands and the result would be transferred back to the array via the local I/O lines. In contrast, in a number of embodiments of the present disclosure, sensing circuitry (e.g., 150) can be configured to perform logical operations on data stored in memory (e.g., array 130) and store the result to the memory without enabling a separate local I/O line coupled to the sensing circuitry. For example, local I/O can travel off the array 130 chip boundary to other parts of a chip or in an integrated circuit to components other than the array.
[0022] Figure 2 illustrates a schematic diagram of a portion of a memory array 201 coupled to sensing circuitry in accordance with a number of embodiments of the present disclosure. The memory cells (referred to generally as memory cells 203) of the memory array 201 are arranged in rows coupled to access lines (e.g., word lines) 204-1, 204-2, 204-3, 204-4, and 204-5 and in columns coupled to sense lines (e.g., digit lines) 205-1, 205-2, 205-3, 205-4,
205- S. For instance, access line 204-1 includes cells 203-1, 203-2, 203-3, 203-4, ..., 203-T). Memory array 201 is not limited to a particular number of access lines and/or sense lines. Although not pictured, each column of memory cells can be associated with a corresponding pair of complementary sense lines.
[0023] Each column of memory cells can be coupled to a sense amplifier
206- 1, 206-2, 206-3, 206-4, ..., 206-U. The sense amplifiers 206-1 to 206-U are coupled to input/output line 266 (I/O, e.g., local I/O) via transistors 208-1, 208- 2, 208-3, 208-4, ..., 208-V. Each column of memory cells can be coupled to an accumulator 231-1, 231-2, 231-3, 231-4, 231-X. Column decode lines 210-1 to 210-W are coupled to the gates of transistors 208-1, 208-2, 208-3, 208-4, 208-V and can be selectively activated to transfer data sensed by respective sense amps 206-1 to 206-U to the secondary sense amplifier 212.
[0024] Data values (e.g., binary values) stored in each memory cell of the array can be arranged so that a bit vector is stored vertically in the array. For
example, if the memory cells coupled to sense line 205-1 are to store a bit vector representing the base ten (10) numerical value of "1", the binary data values stored in the memory cells 203-1, 203-5, 203-9, and 203-13 could be "0," "0," "0," and "1," respectively (as shown as an example in Figure 2). As a further example, memory cells 203-2, 203-6, 203-10, and 203-14 can store data values (e.g., the binary data values of "0," "0," "1," and "1" stored in each descending memory cell coupled tosense line 205-2, respectively) to store a bit vector that represents a base ten (10) numerical value (e.g., base ten (10) value) of 3.
Continuing with the example shown in Figure 2, the memory cells coupled to sense line 205-3 can store a bit vector representing the base ten (10) numerical value of 2 (e.g., the binary data values of "0," "0," "1," and "0" could be respectively stored in memory cells 203-3, 203-7, 203-11, and 203-15, respectively) and the memory cells coupled to the sense line 205-4 could store a bit vector representing the base ten (10) numerical value of 1 (e.g., bit vector 0001 stored in respective memory cells 203-4, 203-8, 203-12, and 203-16).
[0025] As described further below, embodiments of the present disclosure can be used to determine which particular sense line(s) among a group of sense lines includes cells storing a bit vector representing an extremum value. For instance, in the example shown in Figure 2, the maximum base ten value among the bit vectors stored in the cells coupled to sense lines 205-1, 205-2, 205-3, and 205-4 is three, as represented by bit vector 0011 stored in respective cells 203-2, 203-6, 203-10, and 203-14. The base ten value of the bit vector associated with sense line 205-1 is one (e.g., 0001), with sense line 205-3 is two (e.g., 0010), and with sense line 205-4 is one (e.g., 0001). In the example shown in Figure 2, access line 204-1 (ROW 4) represents the highest index (e.g., the most significant bit location such as 2 in this example) access line associated with the stored bit vectors, and access line 204-4 (ROW 1) represents the lowest index (e.g., the least significant bit location such as 2° in this example) access line associated with the stored bit vectors. Embodiments of the present disclosure can be used to determine the location of a stored extremum value. That is, which sense line among the group of sense lines comprises cells storing the maximum data value (e.g., sense line 205-2 in this example), as well as the actual extremum value (e.g., a base ten value of 3 in this example).
[0026] Examples of pseudocode associated with identifying an extremum value stored in memory cells in an array is shown below. For instance, an example of pseudocode associated with identifying a maximum value includes:
max:
C lear Accumulator
InvertAccumulator
WriteRow(destination)
Forall bits in length {
ReadRow (src[bit])
if (AccumulatorBlockOr) {
WriteRow (destination)
exit-forall
}
}
Forall remaining bits in length {
AndRow (src[bit])
if (AccumulatorBlockOr) {
WriteRow (destination)
} else {
ReadRow (destination)
}
}
Example pseudocode associated with identifying a minimum value can include:
Min:
C lear Accumulator
InvertAccumulator
WriteRow (destination)
Forall bits in length {
ReadRow (src[bit])
InvertAccumulator
If (AccumulatorBlockOr) {
WriteRow (destination)
exit-forall
}
}
Forall remaining bits in length {
ReadRow (src[bit])
InvertAccumulator
AndRow (destination)
If (AccumulatorBlockOr) {
WriteRow (destination)
}
}
[0027] As illustrated in the pseudocode shown above, identifying an extremum value can include an initial operation to clear the accumulators (e.g., accumulators 231-1 through 231-X) coupled to the sense lines 205-1 through 205-S. Clearing the accumulators can include storing a known data value (e.g., "1" or "0") in the accumulators. In some examples, clearing the accumulators can include reading known data values stored in memory cells coupled to a particular access line into the accumulators. For example, memory cells coupled to a particular access line could store a binary value of "1 ," and the binary "1" could be read into each accumulator such that all accumulatorsstore binary "1.". In this way, the accumulators can be set to a known data value in order to zero out data values that may have been previously stored in the accumulators. As indicated in the example pseudocode above, the accumulators can also be operated to invert the data values stored therein (e.g., a binary "0" can be inverted to a binary "1" and vice versa) in association with identifying an extremum value. Operation of the accumulators (e.g., 231-1 to 231-X) will be described further in association with Figure 3.
[0028] The data values stored in the accumulators (e.g., after clearing and/or inverting), can be written to memory cells coupled to a particular access line (e.g., a destination access line such as access line 204-5), as indicated by "WriteRow (destination)" in the example pseudocode shown above. For instance, the access line (e.g., 204-5) corresponding to the "destination row" can be activated, and the data values stored in the accumulators 231-1 through 231-X can be driven onto the corresponding sense lines and written to the
corresponding memory cells coupled to the destination row (e.g., ROW 0). Therefore, the cells coupled to the destination row can store a known data value (e.g., "1" or "0"), which can then be used as an initial bit-mask in association with identifying an extremum value as described further below.
[0029] In a number of embodiments, identifying an extremum value can include determining the access line with a highest associated index having one or more cells storing a target data value (e.g., binary "1"). For instance, a determination can be made as to whether the access line (e.g., 204-1) having the highest associated index (e.g., the most-significant access line) includes a cell storing the target data value. The determination as to whether the most- significant access line includes cells storing the target data value can be made,
for instance, by reading the data values stored in the memory cells coupled to the most-significant access line 204-1 into the accumulators 231-1 through 231-X (e.g., as shown in the above pseudocode as "ReadRow(src[bit])". An "OR" operation can be performed to determine whether one or more of the
accumulators store the target data value (e.g., as indicated by
"if(AccumulatorBlockOr)" shown in the above pseudocode).
[0030] An example "OR" operation (which may be referred to herein as a "BlockOR" operation or an "AccumulatorBlockOr") can include determining whether a target data value (e.g., "1") is stored in one or more memory cells coupled to a particular access line (e.g., 204-1). Performing a BlockOR operation can include charging (e.g., precharging) the local I/O line 266 coupled to the secondary sense amplifier 212 to a particular voltage. The I/O line 266 can be precharged (e.g., via control circuitry such as control circuitry 140 shown in Figure 1 and/or sensing circuitry such as circuitry 150 shown in Figure 1) to a voltage such as a supply voltage (e.g., Vcc) or a ground voltage (e.g., 0V).
[0031] A BlockOR operation can include sensing (e.g., reading) the cells coupled to a selected access line using the sense amplifiers (e.g., 206-1 to 206- U). The sense amplifiers can amplify a differential voltage signal (e.g., between complementary sense lines such as 305-1 and 305-2 shown in Figure 3) corresponding to a particular data value (e.g., "1" or "0"). For example, a supply voltage (e.g., Vcc) may correspond to a logic 1 and a ground voltage may correspond to a logic 0. The sensed data values can be stored in the
corresponding accumulators (e.g., 231-1 to 231-X).
[0032] To perform a BlockOR operation, the column decode lines (e.g.,
210-1 to 210-W) coupled to the selected cells can be activated in parallel (e.g., such that respective transistors 208-1 to 208-V are turned on) in order to transfer the voltages on the corresponding sense lines to the local I/O line 266. The sensing circuitry (e.g., SSA 212) can sense whether the precharged voltage of the local I/O line 266 changes (e.g., by more than a threshold amount) responsive to activation of the column decode lines.
[0033] For instance, if the I/O line 266 is precharged to Vcc and one or more of the selected cells coupled to the selected access line stores a logic 0 (e.g., 0V), then the SSA 212 can sense a pull down (e.g., decrease) of the voltage on I/O line 266. Alternatively, if the I/O line 266 is precharged to a ground
voltage and one or more of the selected cells coupled to the selected access line stores a logic 1 (e.g., Vcc), then the SSA 212 can sense a pull up (e.g., increase) of the voltage on I/O line 266. The determination of whether one or more selected cells coupled to a selected access line stores a particular data value is effectively performing a logic "OR" operation. In this manner, voltages corresponding to data sensed by the sense amps 206-1 to 206-U and/or stored in accumulators 231-1 to 231-X can be transferred, in parallel, to the local I/O line 266 and sensed by SSA 212 as part of a BlockOR operation. Embodiments of the present disclosure are not limited to particular precharge voltages of local I/O line 266 and/or to particular voltage values corresponding to logic 1 or logic 0.
[0034] It is noted that the BlockOR operation may be performed on data values stored in the accumulators or on data values stored in the sense amplifiers. For instance, as shown in Figure 3, the sensing circuitry includes pass transistors 307-1 and 307-2 that can be turned off to decouple the cross- coupled transistor pairs 308-1/308-2 and 309-1/309-2 from the complementary sense lines 305-1/305-2. As such, data values sensed by sense amps 206-1 to 206-U may be transferred to SSA 212, without first being stored in accumulators 231-1 to 231-X, in a number of embodiments. Embodiments are not limited to the particular BlockOR operation described above.
[0035] In the example shown in Figure 2, a BlockOR operation performed on the data stored in cells 203-1, 203-2, 203-3, and 203-4 coupled to the most-significant access line 204-1 will result in a determination that none of the cells stores the target data value (e.g., "1"). As such, the SSA 212 can store a "0," which can indicate the result of the BlockOR. The result of the BlockOR can be transferred from SSA 212 to other sensing circuitry and/or to control circuitry (e.g., control circuitry 140 shown in Figure 1). Since none of the cells coupled to the most-significant access line 204-1 store the target data value, the "WriteRow(destination)" shown in the "if (AccumulatorBlockOR)" loop does not occur, and a BlockOR is performed on the data stored in the cells 203-5, 203-6, 203-7, and 203-8 coupled to the next most-significant access line 204-2. For instance, the data stored by the cells coupled to access line 204-2 are read into the accumulators 231-1 to 231-4, and the corresponding voltages are provided to local I/O line 266 via activation of column decode lines 210-1 to 210-4. Since none of the cells coupled to access line 204-2 store a "1," the SSA
212 will not sense a threshold amount of voltage change on I/O line 266 and the SSA 212 can again store a "0," indicating the result of the BlockOR operation. Since none of the cells coupled to access line 204-2 stored the target data value, a BlockOR is performed on the cells 203-9, 203-10, 203-11, and 203-12 coupled to the next most-significant access line 204-3. In this example, cells 203-10 and 203-11 store the target data value (e.g., logic "1"). As such, the SSA will detect a change in the precharged voltage of local I/O line 266 as part of the BlockOR operation, indicating that one or more of the cells stores the target data value. Therefore, the SSA 212 can store a "1," which can indicate the result of the BlockOR (e.g., that one or more cells was determined to store a logic "1"). The result indicates that access line 204-3 is the highest index access line having one or more cells coupled thereto storing the target data value.
[0036] In accordance with the example pseudocode shown above, the data values stored in cells 203-9, 203-10, 203-11, and 203-12 of access line 204- 3 are written to the corresponding cells of destination row 204-5 (e.g., as part of the "WriteRow(destination)" step in the first "if clause). As such, the data values stored in cells 203-9, 203-10, 203-11, and 203-12, which were read into respective accumulators 231-1, 231-2, 231-3, and 231-4, are written to the cells 203-17, 203-18, 203-19, and 203-20 of destination row 204-5. Therefore, upon exiting the "forall bits in length" loop shown in the above pseudocode, cells 203- 17, 203-18, 203-19, and 203-20 store "0," "1," "1," and "0," respectively, and those data values can serve as a bit mask for the "forall remaining bits in length" loop indicated in the above pseudocode.
[0037] In a number of embodiments, identifying an extremum value can include performing a number of AND operations in association with determining which sense line from among a number of sense lines comprises cells coupled thereto storing the extremum value. For example, the "forall remaining bits in length" loop shown in the above pseudocode can be used. As described below, and as indicated in the above pseudocode, a number of AND operations and BlockOR operations can be performed on a next-most-significant access line basis. During the process of performing the operations on the access line by access line basis, the data values stored in the cells of the destination row can serve as a bit mask for subsequent access lines. A result indicating which one or more of the sense lines includes cells coupled thereto storing the extremum value
can be stored in the cells of the destination row (e.g., 204-5), such that after the above process is performed on the least significant access line, the destination row (e.g., 204-5) can be read (e.g., see "ReadRow(destination)" in the above pseudocode) to determine the sense line(s) storing the extremum value (in their respective cells).
[0038] As an example, an AND operation can be performed (e.g.,
"ANDRow(src[bit])" shown in the above pseudocode), by using the
corresponding accumulators 231-1 through 231-4, on the data values stored in the destination row 204-5 (corresponding to the data stored in memory cell 203- 9 through 203-12 that are coupled to the access line storing most- significant bit data) and the data values stored in the memory cells coupled to the access line storing next-most significant bit data (e.g., the memory cells coupled to access line 204-4). The result of the AND operations can be stored in the
corresponding accumulators. Further discussion of operation of the
accumulator(s) to perform an AND operation is described below in association with Figure 3.
[0039] In the above example, the binary data values "0," "1," "1," and
"0" stored in respective memory cells 203-17, 203-18, 203-19, and 203-20 of destination row 204-5, and serving as a bit-mask, can be AND-ed with the data values "1," "1," "0," and "1" stored in the corresponding respective memory cells 203-13, 203-14, 203-15, and 203-16 coupled to access line 204-4. The results of the AND operations can be stored in the corresponding accumulators 231-1 to 231-4. In this example, an AND operation results in the accumulator 231-1 storing a "0" (e.g., the result of ANDing binary value "0" stored in cell 203-17 and binary value "1" stored in cell 203-13). The AND operation also results in the accumulator 231-2 storing a "1" (e.g., the result of ANDing binary value "1" stored in cell 203-18 and binary value "1" stored in cell 203-14), the accumulator 231-3 storing a "0" (e.g., the result of ANDing binary value "1" stored in cell 203-19 and binary value "0" stored in cell 203-15), and
accumulator 231-4 storing a "0" (e.g., the result of ANDing binary value "0" stored in cell 203-20 and binary value "1" stored in cell 203-16). In other words, the output of the AND operation can be binary values "0," "1," "0," and "0," which would be stored in accumulators 231-1, 231-2, 231-3, and 231-4, respectively.
[0040] As indicated by the second "if statement (e.g., "if
(AccumulatorBlockOr)") shown in the above pseudocode, a BlockOR operation can be performed on the "ANDed" data stored in the accumulators. The BlockOR operation can be used to determine whether one or more of the output values of the AND operation performed on the cells of a particular access line and the cells of the bit mask (e.g., the cells of the destination row) includes the target data value (e.g., "1"). If the BlockOR result of the ANDed values is a binary "1," (e.g., one or more of the cells of the destination row stores a "1" and the cell it is ANDed with also stores a "1") then the result of the AND operations (which are stored in the corresponding accumulators) are written to the cells of the destination row (e.g., "WriteRow(destination)"). The data values stored in the cells of the destination row then serve as a bit mask for the next subsequently less significant access line, or they indicate (via a stored value of "1") which sense line(s) store the extremum value of the corresponding stored bit vectors. If the BlockOR result of the ANDed values is a binary "0," (e.g., none of the cells being ANDed with the cells of the destination row store a "1" and match a "1" stored in destination row), then the result of the AND operations (which are stored in the corresponding accumulators) are not written to the cells of the destination row (e.g., the bit mask stored in the cells of the destination row remains unchanged). The unchanged data values stored in the cells of the destination row still serve as a bit mask for the next subsequently less significant access line, or they indicate (via a stored value of "1") which sense line(s) store the extremum value of the corresponding bit vectors stored in the cells of the respective sense lines. As the above process continues on a next less significant access line basis, the number of binary "Is" stored in the cells of the destination row is reduced until at the end of the process, the cells of the destination row (e.g., the bit mask) that still store a "1" indicate which sense line(s) store the extremum value.
[0041] With reference to the example shown in Figure 2, the result of a
BlockOr performed on the data values stored in the accumulators 231-1, 231-2, 231-3, and 231-4 (e.g., the bit mask values stored in cells 203-17, 203-18, 203- 19, and 203-20 of destination row 204-5) ANDed with the values stored in the cells 203-13, 203-14, 203-15, and 203-15 of the selected access line 204-4 is a binary "1." That is, after the AND operation, the accumulators 231-1, 231-2,
231-3, and 231-4 store the data values "0," "1," "0," and "0," respectively. As such, since one or more of the ANDed data values is a "1," the result of the BlockOr is "1." Since the BlockOr results in a "1 ," the data values currently stored in the accumulators are written to the corresponding destination row cells such that cells 203-17, 203-18, 203-19, and 203-20 of destination row 204-5 store the data values "0," "1," "0," and "0," respectively. Furthermore, since access line 204-4 is the least significant access line, the data values stored in the cells of the destination row 204-5 now indicate the sense line having cells storing the extremum value (e.g., a maximum value in this example).
[0042] In a number of embodiments, the value of the extremum can be identified. For instance, after completion of the above process, the data values stored in the cells of the destination row (e.g., the bit mask) can be read and can be reported to control circuitry (e.g., control circuitry 140 shown in Figure 1). In this example, the target value of "1" would be stored in cell 203-18, while the data values of cells 203-17, 203-19, and 203-20 would store a "0." As such, the sense line 205-2 is determined to be the sense line from among the group of sense lines (e.g., 105-1 to 205-4) that includes cells storing the bit vector corresponding to the extremum (e.g., binary value "0011" or decimal value "3"). The value of the data stored as a bit vector can be identified, for instance, by reading the cells coupled to the sense line(s) determined to include cells storing the extremum. In this example, cells 203-2, 203-6, 203-10, and 203-14 can be read and the data values stored therein can be reported to control circuitry, or instance, to identify the value of bit vector stored in those cells. The above referenced operations are not limited to determining a maximum value and can determine a number of extremum values. For instance, example pseudocode associated with determining a minimum value is illustrated above.
[0043] Determining a minimum value among a set of data stored as vectors in an array can be done in a similar manner as that described above in association with determining a maximum value. However, in determining a minimum value, a binary "0" can be used as the target value. As such, performing BlockOr and/or AND operations as described above can include determining whether one or more cells stores a binary "0" as opposed to a binary "1." In a number of embodiments, the accumulators can be operated to invert data values stored therein. That is, a stored value of "1" in an accumulator can
be inverted to a "0" and a stored value of "0" in an accumulator can be inverted to a "1." An invert operation will be described further below in association with Figure 3.
[0044] As one example, assume that as part of identifying a minimum data value among a number of data values stored as vectors, one wants to determine whether one or more of four cells coupled to a particular access line stores binary "0" (e.g., as part of the "forall bits in length" loop shown in the pseudocode above). Also assume that the cell coupled to a first sense line stores the data value "1," the cell coupled to a second sense line stores the data value "1," the cell coupled to a third sense line stores the data value "0," and the cell coupled to a fourth sense line stores the data value "1." In order to determine whether one or more of the cells stores a "1," the data can be read into four corresponding accumulators and a BlockOr operation can be performed, as described above. However, in order to determine whether one or more of the cells stores a "0," the data read into the accumulators can be inverted such that the accumulator coupled to the first sense line stores a "0," the accumulator coupled to the second sense line stores a "0," the accumulator coupled to the third sense line stores a "1," and the accumulator coupled to the fourth sense line stores a "0." That is, the BlockOr operation is performed on the inverse of the data values read into the accumulators from the cells of the particular access line. Essentially, inverting the data values stored in the accumulators as part of the execution of the above pseudocode provides a manner for determining whether one or more particular cells stores a binary "0" as opposed to determining whether one or more particular cells stores a binary "1." Therefore, a similar method used for determining a maximum stored bit vector value can be used for determining a minimum stored bit vector value.
[0045] Figure 3 illustrates a schematic diagram of a portion of a memory array 330 coupled to sensing circuitry in accordance with a number of embodiments of the present disclosureln this example, the memory array 330 is a DRAM array of 1T1C (one transistor one capacitor) memory cells each comprised of an access device 302 (e.g., transistor) and a storage element 303 (e.g., a capacitor). Embodiments, however, are not limited to this example and other storage element array types may be included, e.g., cross point arrays having PCRAM memory elements, etc. The cells of array 330 are arranged in
rows coupled by word lines 304-0 (RowO), 304-1 (Rowl), 304-2, (Row2) 304-3 (Row3), ..., 304-N (RowN) and columns coupled by sense lines (e.g., digit lines) 305-1 (D) and 305-2 (D_). In this example, each column of cells is associated with a pair of complementary sense lines 305-1 (D) and 305-2 (D_).
[0046] In a number of embodiments, an accumulator (e.g., 331) can comprise a number of transistors formed on pitch with the transistors of a sense amp (e.g., 306) and/or the memory cells of the array (e.g., 330), which may
2 2
conform to a particular feature size (e.g., 4F , 6F , etc.). As described further below, the accumulator 331 can, in conjunction with the sense amp 306, operate to perform various compute operations using data from array 330 as input and store the result back to the array 330 without transferring the data via a sense line address access (e.g., without firing a column decode signal such that data is transferred to circuitry external from the array and sensing circuitry via local local I/O lines (e.g., 266-1 in Fig. 2).
[0047] In the example illustrated in Figure 3, the circuitry corresponding to accumulator 331 comprises five transistors coupled to each of the sense lines D and D_; however, embodiments are not limited to this example. Transistors
307- 1 and 307-2 have a first source/drain region coupled to sense lines D and D_, respectively, and a second source/drain region coupled to a cross coupled latch (e.g., coupled to gates of a pair of cross coupled transistors, such as cross coupled NMOS transistors 308-1 and 308-2 and cross coupled PMOS transistors 309-1 and 309-2. As described further herein, the cross coupled latch comprising transistors 308-1,308-2, 309-1, and 309-2 can be referred to as a secondary latch (the cross coupled latch corresponding to sense amp 306 can be referred to herein as a primary latch).
[0048] The transistors 307-1 and 307-2 can be referred to as pass transistors, which can be enabled via respective signals 311-1 (Passd) and 311-2 (Passdb) in order to pass the voltages or currents on the respective sense lines D and D_ to the inputs of the cross coupled latch comprising transistors 308-1,
308- 2, 309-1, and 309-2 (e.g., the input of the secondary latch). In this example, the second source/drain region of transistor 307-1 is coupled to a first source/drain region of transistors 308-1 and 309-1 as well as to the gates of transistors 308-2 and 309-2. Similarly, the second source/drain region of
transistor 307-2 is coupled to a first source/drain region of transistors 308-2 and 309-2 as well as to the gates of transistors 308-1 and 309-1.
[0049] A second source/drain region of transistor 308-1 and 308-2 is commonly coupled to a negative control signal 312-1 (Accumb). A second source/drain region of transistors 309-1 and 309-2 is commonly coupled to a positive control signal 312-2 (Accum). The Accum signal 312-2 can be a supply voltage (e.g., Vcc) and the Accumb signal can be a reference voltage (e.g., ground). Enabling signals 312-1 and 312-2 activates the cross coupled latch comprising transistors 308-1, 308-2, 309-1, and 309-2 corresponding to the secondary latch. The activated sense amp pair operates to amplify a differential voltage between common node 317-1 and common node 317-2 such that node 317-1 is driven to one of the Accum signal voltage and the Accumb signal voltage (e.g., to one of Vcc and ground), and node 317-2 is driven to the other of the Accum signal voltage and the Accumb signal voltage. As described further below, the signals 312-1 and 312-2 are labeled "Accum" and "Accumb" because the secondary latch can serve as an accumulator while being used to perform a logical operation (e.g., an AND operation). In a number of embodiments, an accumulator comprises the cross coupled transistors 308-1, 308-2, 309-1, and 309-2 forming the secondary latch as well as the pass transistors 307-1 and 308- 2.
[0050] In this example, the accumulator 331 also includes inverting transistors 314-1 and 314-2 having a first source/drain region coupled to the respective digit lines D and D_. A second source/drain region of the transistors 314-1 and 314-2 is coupled to a first source/drain region of transistors 316-1 and 316-2, respectively. The second source/drain region of transistors 316-1 and 316-2 can be coupled to a ground. The gates of transistors 314-1 and 314-2 are coupled to a signal 313 (InvD). The gate of transistor 316-1 is coupled to the common node 317-1 to which the gate of transistor 308-2, the gate of transistor 309-2, and the first source/drain region of transistor 308-1 are also coupled. In a complementary fashion, the gate of transistor 316-2 is coupled to the common node 317-2 to which the gate of transistor 308-1, the gate of transistor 309-1, and the first source/drain region of transistor 308-2 are also coupled. As such, an invert operation can be performed by enabling signal InvD, whichinverts the
data value stored in the secondary latch and drives the inverted value onto sense lines 305-1 and 305-2.
[0051] In a number of embodiments, and as indicated above in association with Figure 2,the accumulator can be used to perform AND operations in association with identifying an extremum value. For example, a data value stored in a particular cell can be sensed by a corresponding sense amp 306. The data value can be transferred to the data latch of the accumulator 331 by activating the Passd (311-1) and Passdb (311-2) signals as well as the Accumb (312-1) and Accum signals (312-2). To AND the data value stored in the accumulator with a data value stored in a different particular cell coupled to a same sense line, the access line to which the different particular cell is coupled can be activated. The sense amp 306 can be activated (e.g., fired), which amplifies the differential signal on sense lines 305-1 and 305-2. Activating only Passd (311-1) (e.g., while maintaining Passdb (311-2) in a deactivated state) results in accumulating the data value corresponding to the voltage signal on sense line 305-1 (e.g., Vcc corresponding to logic "1" or ground corresponding to logic "0"). The Accumb and Accum signals remain activated during the AND operation.
[0052] Therefore, if the data value stored in the different particular cell
(and sensed by sense amp 306) is a logic "0", then value stored in the secondary latch of the accumulator is asserted low (e.g., ground voltage such as 0V), such that it stores a logic "0." However, if the value stored in the different particular cell (and sensed by sense amp 306) is not a logic "0," then the secondary latch of the accumulator retains its previous value. Therefore, the accumulator will only store a logic "1" if it previously stored a logic "1" and the different particular cell also stores a logic "1." Hence, the accumulator 331 is operated to perform a logic AND operation. As noted above, the invert signal 313 can be activated in order to invert the data value stored by the accumulator 331, which can be used, for example, in identifying a minimum data value as described above.
[0053] Figure 4 illustrates an example of a method for identifying an extremum value in accordance with a number of embodiments of the present disclosure. At block 470, the method includes determining a location of an extremum value of a set of N stored as vectors in a memory array. Determining a location of the extremum value can include a number of operations that remain
constant with respect to a value of N. That is, the number of operations to determine a location of an extremum value can remain constant as the value of N is increased and/or decreased. The set of N can be stored as bit vectors in the memory array. Each bit vector can represent a numerical base ten (10) number. For example, a bit vector of four memory cells (e.g., memory cells 203-2, 203-6, 203-10, and 203-14 in Figure 2) can store a binary value in each of the four memory cells (e.g., binary values "0," "0," "1," and "1," respectively) representing a base ten (10) numerical value (e.g., numerical value 3).
[0054] The number of operations to determine the extremum value can change with respect to a number of row lines used to encode numerical values (e.g., base ten (10) numerical values) in a vector (e.g., a bit vector). For example, a base ten (10) numerical value stored vertically in 10 memory cells coupled to a sense line (corresponding to 10 binary values representing the base ten (10) numerical value) can take a different number of operations (e.g., fewer operations) to determine an extremum value than a base ten (10) numerical value stored vertically in 100 memory cells. The number of operations to determine the extremum value can include determining whether a target data value is stored in a memory cell. In one example, the target data value can include a binary value of "1" when determining a location of a maximum value. In one example, the target data value can include a binary value of "0" when determining a minimum value. However, embodiments are not limited to a particular target binary value when representing an extremum value.
[0055] At block 472, the method can include determining the extremum value by reading memory cells coupled to a sense line based on the determined location of the extremum value. For example, in Figure 2, memory cell 203-14 can be determined to store a target data value (e.g., binary value of "1") from an AND operation (e.g., performing an AND operation of binary value "1" in memory cell 203-14 with binary value "1" in memory cell 203-10) indicating a maximum value. The indication can be based on a number of operations that determine memory cells coupled to other sense lines do not include the target data value while performing an operation.
[0056] Figure 5 illustrates an example of a method for identifying an extremum value in accordance with a number of embodiments of the present disclosure. The extremum value can include a maximum value or a minimum
value. When determining a maximum value, a target data value can include a binary 1. When determining a minimum value, a target data value can include a binary 0. A group of memory cells can include a number of cells coupled to a particular access line (e.g., row) of an array of memory cells. A vector can be stored in cells coupled to a respective sense line (e.g., column) of the array of memory cells and each memory cell in the row can be coupled to a respective sense line of a plurality of sense lines. A plurality of extremums can be determined. For example, if there are a plurality of sense lines storing a same extremum bit-vector value in cells coupled thereto, a plurality of extremums can be determined and located. A determination of a plurality of sense lines that are coupled to memory cells in the group of memory cells that stored the target data value can be performed.
[0057] At block 574, the method can include sensing, with the sensing circuitry coupled to the memory cells, data stored in a group of the memory cells coupled to a particular access line. The sensing can include reading a row of memory cells of an array of memory cells. The method can include reading data stored in memory cells coupled to the sense line to determine a value of an extremum. The sensed data stored in a group of the memory cells (e.g., memory cells 203-9 through 203-12 in Fig. 2 ("Row 2") can be stored in a group of memory cells coupled to a destination access line (e.g., access line 204-5 in Fig. 2 ("Row 0")). The method can include storing a bit mask in the group of memory cells coupled to the destination access line prior to sensing the data stored in the group of memory cells coupled to the particular access line. The bit mask can include the data stored in the group of memory cells coupled to the particular access line.
[0058] The method can include inverting data values stored in a number of accumulators associated with the group of memory cells prior to sensing the data stored in the group of memory cells coupled to the particular access line. The method can include clearing out data values previously stored in the number of accumulators by storing default data values prior to inverting the default data values stored in the number of accumulators. The method can include performing a second inversion of the data values stored in the number of accumulators that correspond to data stored in the group of memory cells coupled to the particular access line values stored in the number of accumulators
after sensing the data stored in the group of memory cells coupled to the particular access line and before performing the operation with the sensing circuitry to determine if the sensed data includes the target data value. The method can include inverting the data stored in the other access line
corresponding to a next-lower index of the vector prior to comparing the data stored in the group of memory cells coupled to the destination access line to the data stored in the group of memory cells coupled to the other access line to provide the output data.
[0059] At block 576, the method can include performing an operation with the sensing circuitry to determine if the sensed data includes a target data value. Performing an operation with the sensing circuitry to determine if the sensed data includes the target data value can include performing a BlockOR operation with an accumulator of the sensing circuitry as described above. The BlockOR operation performed with an accumulator of the sensing circuitry can include charging an I/O line to a level corresponding to the target data value. The BlockOR operation can include transferring the sensed dtat from a plurality of sense amplifiers to the I/O line. The BlockOR operation can include determining whether the level of the I/O line changes responsive to transferring the sensed data. Determining whether the level of the I/O line changes can include detecting, with a secondary sense amplifier, whether the level changes by at least a threshold amount. The level changing by a threshold amount can indicate that the sensed data stores the target data value.
[0060] At block 578, the method can include determining, if the sensed data includes the target data value, a location of a sense line that is coupled to a memory cell in a group of memory cells that stores a target data value. The determination can include a comparison (e.g., performing an AND operation) of data stored in a group of memory cells coupled to the destination access line (e.g., access line 204-5 in Fig. 2) to data stored in a group of memory cells coupled to another access line (e.g., access line 204-4 in Fig. 2 ("Row 1")) to provide output data. The another access line can correspond to a lower index of the vectors than the particular access line (e.g., memory cells coupled to access line 204-4 store a lower index of values than the memory cells coupled to access line 204-3).
[0061] An operation (e.g., a BlockOR operation) can be performed on the output data from the comparison using sensing circuitry to determine if the output data includes a target data value (e.g., a binary value "1"). If the output data includes a target data value, a determination can be made to determine a location of a sense line that is coupled to a memory cell in the group of memory cells coupled to the other access line that stores the target data value.
Determining a location of the sense line that is coupled to the memory cell in the group of memory cells coupled to a particular access line that stores the target data value can include comparing the data stored in the group of memory cells coupled to the destination access line to data stored in a group of the memory cells coupled to a next-most significant access line to provide additional output data. The next-most significant access line can correspond to a lower index of the vectors than the other access line. Determining a location of the sense line that stores the target data value can include performing an operation with the sensing circuitry to determine if the additional output data includes the target data value (e.g., an additional BlockOR operation). If the additional output data includes the target data value, a determination of a location of a sense line that is coupled to a memory cell in the group of memory cells coupled to the next-most significant aceess line that stores the target data value can be performed.
[0062] The comparison of a most-significant access line storing data values with a next-most significant access line storing data values can include repeating the comparison and the determination whether the additional output data includes the target data value for each access line corresponding to a lower index of the vectors than the next-most significant access line and, if the respective output data includes the target data value, determining a location of a sense line that is coupled to a memory cell in the group of memory cells coupled to the respective access line that stores the target data value.
[0063] If the sense data does not include a target data value, the method can include sensing, with the sensing circuitry coupled to the array of memory cells, data stored in a group of the memory cells coupled to another access line. The other access line can correspond to a lower index of the vectors than the particular access line. If the sense data does not include a target data value, the method can include performing an operation with sensing circuitry to determine if the data sensed from the group of memory cells coupled to the other access
line includes the target data value (e.g., performing a BlockOR operation on the memory cells coupled to the next-most significant access line). If the data sensed from the group of memory cells coupled to the other access line includes the target data value, the method can include determining a location of a sense line that is coupled to a memory cell in the group of memory cells coupled to the other access line that stores the target data value.
[0064] Although specific embodiments have been illustrated and described herein, those of ordinary skill in the art will appreciate that an arrangement calculated to achieve the same results can be substituted for the specific embodiments shown. This disclosure is intended to cover adaptations or variations of one or more embodiments of the present disclosure. It is to be understood that the above description has been made in an illustrative fashion, and not a restrictive one. Combination of the above embodiments, and other embodiments not specifically described herein will be apparent to those of skill in the art upon reviewing the above description. The scope of the one or more embodiments of the present disclosure includes other applications in which the above structures and methods are used. Therefore, the scope of one or more embodiments of the present disclosure should be determined with reference to the appended claims, along with the full range of equivalents to which such claims are entitled.
[0065] In the foregoing Detailed Description, some features are grouped together in a single embodiment for the purpose of streamlining the disclosure. This method of disclosure is not to be interpreted as reflecting an intention that the disclosed embodiments of the present disclosure have to use more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive subject matter lies in less than all features of a single disclosed embodiment. Thus, the following claims are hereby incorporated into the Detailed Description, with each claim standing on its own as a separate embodiment.