# intel®

### Intel Celeron<sup>TM</sup> Processor Specification Update

Release Date: April 15, 1998

Order Number: 243748-001

The Intel Celeron<sup>™</sup> processor may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are documented in this Specification Update.

Information in this document is provided in connection with Intel products. No license, express or implied, by estoppel or otherwise, to any intellectual property rights is granted by this document. Except as provided in Intel's Terms and Conditions of Sale for such products, Intel assumes no liability whatsoever, and Intel disclaims any express or implied warranty, relating to sale and/or use of Intel products including liability or warranties relating to fitness for a particular purpose, merchantability, or infringement of any patent, copyright or other intellectual property right. Intel products are not intended for use in medical, life saving, or life sustaining applications.

Intel may make changes to specifications and product descriptions at any time, without notice.

Designers must not rely on the absence or characteristics of any features or instructions marked "reserved" or "undefined." Intel reserves these for future definition and shall have no responsibility whatsoever for conflicts or incompatibilities arising from future changes to them.

The Intel Celeron<sup>™</sup> processor may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request.

The Specification Update should be publicly available following the last shipment date for a period of time equal to the specific product's warranty period. Hardcopy Specification Updates will be available for one (1) year following End of Life (EOL). Web access will be available for three (3) years following EOL.

Contact your local Intel sales office or your distributor to obtain the latest specifications and before placing your product order.

Copies of documents which have an ordering number and are referenced in this document, or other Intel literature, may be obtained by calling 1-800-548-4725 or by visiting Intel's website at http://www.intel.com

Copyright © Intel Corporation 1998.

\* Third-party brands and names are the property of their respective owners.

### CONTENTS

| REVISION HISTORY                                              | v  |
|---------------------------------------------------------------|----|
| PREFACE                                                       | vi |
| Specification Update for Intel Celeron <sup>™</sup> Processor |    |
| GENERAL INFORMATION                                           | 3  |
| ERRATA                                                        | 8  |
| DOCUMENTATION CHANGES                                         |    |
| SPECIFICATION CLARIFICATIONS                                  |    |



### **REVISION HISTORY**

| Date of Revision | Version | Description                                                                              |  |
|------------------|---------|------------------------------------------------------------------------------------------|--|
| April 15, 1998   | -001    | This document is the first Specification Update for the Celeron <sup>TM</sup> processor. |  |

### PREFACE

This document is an update to the specifications contained the *Intel Celeron*<sup>TM</sup> *Processor at 266 MHz* datasheet (Order Number 243658), and the *Intel Architecture Software Developer's Manual*, *Volumes 1, 2 and 3* (Order Numbers 243190, 243191, and 243192, respectively). It is intended for hardware system manufacturers and software developers of applications, operating systems, or tools. It contains Specification Changes, S-Specs, Errata, Specification Clarifications, and Documentation Changes.

Into

#### Nomenclature

**Specification Changes** are modifications to the current published specifications for the Intel Celeron<sup>TM</sup> processor. These changes will be incorporated in the next release of the specifications.

**S-Specs** are exceptions to the published specifications, and apply only to the units assembled under that s-spec.

**Specification Clarifications** describe a specification in greater detail or further highlight a specification's impact to a complex design situation. These clarifications will be incorporated in the next release of the specifications.

**Documentation Changes** include typos, errors, or omissions from the current published specifications. These changes will be incorporated in the next release of the specifications.

**Errata** are design defects or errors. Errata may cause the Intel Celeron processor's behavior to deviate from published specifications. Hardware and software designed to be used with any given processor stepping must assume that all errata documented for that processor stepping are present on all devices.

#### Identification Information

Complete identification information of the Intel Celeron processor can be found in the *Intel Processor Identification and the CPU Instruction* application note (Order Number 241618).

Specification Update for Intel Celeron<sup>TM</sup> Processor

### INTEL CELERON™ PROCESSOR SPECIFICATION UPDATE

### **GENERAL INFORMATION**

### Intel Celeron<sup>TM</sup> Processor Markings



#### Dynamic laser mark area

NOTES:

- SYYYY = S-spec Number.
- FFFFFFFF = FPO # (Test Lot Traceability #).
- COA = Country of Assembly.

| CPUID |        |       |          |                  |        |                         |                     |       |  |  |
|-------|--------|-------|----------|------------------|--------|-------------------------|---------------------|-------|--|--|
| Туре  | Family | Model | Stepping | Core<br>Stepping | S-Spec | Speed (MHz)<br>Core/Bus | S.E.C.<br>Substrate | Notes |  |  |
| 0     | 6      | 5     | 0        | dA0              | SL2SY  | 266/66                  | Rev. 1              |       |  |  |
| 0     | 6      | 5     | 1        | dA1              | SL2TR  | 266/66                  | Rev. 1              |       |  |  |
| 0     | 6      | 5     | 1        | dA1              | SL2QG  | 266/66                  | Rev. 1              | 1     |  |  |

### Intel Celeron<sup>TM</sup> Processor Identification Information

NOTE:

1. This is a boxed Intel Celeron<sup>™</sup> processor with an attached fan heatsink.

### int<sub>el</sub>.

### INTEL CELERON<sup>TM</sup> PROCESSOR SPECIFICATION UPDATE

### Summary Table of Changes

The following table indicates the Specification Changes, Errata, Specification Clarifications, or Documentation Changes which apply to the Intel Celeron processors. Intel intends to fix some of the errata in a future stepping of the component, and to account for the other outstanding issues through documentation or specification changes as noted. This table uses the following notations:

### CODES USED IN SUMMARY TABLE

| X:                       | Specification Change, Erratum, Specification Clarification,<br>or Documentation Change applies to the given processor<br>stepping. |
|--------------------------|------------------------------------------------------------------------------------------------------------------------------------|
| Doc:                     | Intel intends to update the appropriate documentation in a future revision.                                                        |
| Fix:                     | This erratum is intended to be fixed in a future stepping of the component.                                                        |
| Fixed:                   | This erratum has been previously fixed.                                                                                            |
| NoFix:                   | There are no plans to fix this erratum.                                                                                            |
| (No mark) or (blank box) | ): This item is fixed in or does not apply to the given stepping.                                                                  |
| AP:                      | APIC related erratum.                                                                                                              |
| SUB:                     | This column refers to errata on the Intel Celeron <sup>™</sup> processor substrate.                                                |
| Shaded:                  | This erratum is either new or modified from the previous version of the document.                                                  |

int<sub>el</sub>.

| NO. | dA0 | dA1 | SUB | Plans | ERRATA                                                                                                                   |
|-----|-----|-----|-----|-------|--------------------------------------------------------------------------------------------------------------------------|
| 1   | X   | Х   |     | NoFix | FP Data Operand Pointer may be incorrectly calculated<br>after FP access which wraps 64-Kbyte boundary in<br>16-bit code |
| 2   | X   | Х   |     | NoFix | Differences exist in debug exception reporting                                                                           |
| 3   | X   | Х   |     | NoFix | Code fetch matching disabled debug register may cause debug exception                                                    |
| 4   | Х   | Х   |     | NoFix | FP inexact-result exception flag may not be set                                                                          |
| 5   | X   | Х   |     | NoFix | BTM for SMI will contain incorrect FROM EIP                                                                              |
| 6   | Х   | Х   |     | NoFix | I/O restart in SMM may fail after simultaneous MCE                                                                       |
| 7   | X   | Х   |     | NoFix | Branch traps do not function if BTMs are also enabled                                                                    |
| 8   | X   | Х   |     | NoFix | Machine check exception handler may not always execute successfully                                                      |
| 9   | X   | Х   |     | NoFix | LBER may be corrupted after some events                                                                                  |
| 10  | X   | Х   |     | NoFix | BTMs may be corrupted during simultaneous L1 cache line replacement                                                      |
| 11  | X   | Х   |     | Fix   | Potential early deassertion of LOCK# during split-lock cycles                                                            |
| 12  | X   | Х   |     | NoFix | A20M# may be inverted after returning from SMM and Reset                                                                 |
| 13  | Х   | Х   |     | Fix   | Reporting of floating-point exception may be delayed                                                                     |
| 14  | X   | Х   |     | NoFix | Near CALL to ESP creates unexpected EIP address                                                                          |
| 15  | Х   | Х   |     | Fix   | Built-in self-test always gives nonzero result                                                                           |
| 16  | Х   | Х   |     | Fix   | THERMTRIP# may not be asserted as specified                                                                              |
| 17  | Х   |     |     | Fix   | Cache state corruption in the presence of page A/D-bit setting and snoop traffic                                         |
| 18  | Х   |     |     | Fix   | Snoop cycle generates spurious machine check exception                                                                   |
| 19  | X   | Х   |     | Fix   | MOVD/MOVQ instruction writes to memory prematurely                                                                       |

### int<sub>el</sub>,

### INTEL CELERON<sup>TM</sup> PROCESSOR SPECIFICATION UPDATE

| NO. | dA0 | dA1 | SUB | Plans | ERRATA                                                                           |
|-----|-----|-----|-----|-------|----------------------------------------------------------------------------------|
| 20  | Х   | Х   |     | NoFix | Memory type undefined for nonmemory operations                                   |
| 21  | Х   | Х   |     | NoFix | Bus protocol conflict with optimized chipsets                                    |
| 22  | Х   | Х   |     | NoFix | FP Data Operand Pointer may not be zero after power<br>on or Reset               |
| 23  | Х   | Х   |     | NoFix | MOVD following zeroing instruction can cause incorrect result                    |
| 1AP | Х   | Х   |     | NoFix | APIC access to cacheable memory causes<br>SHUTDOWN                               |
| 2AP | Х   | Х   |     | NoFix | Write to mask LVT (programmed as EXTINT) will not deassert outstanding interrupt |

| NO. | dA0 | dA1 | SUB | Plans | DOCUMENTATION CHANGES                                                                            |
|-----|-----|-----|-----|-------|--------------------------------------------------------------------------------------------------|
| 1   | Х   | Х   |     | Doc   | Invalid arithmetic operations and masked responses to<br>them relative to FIST/FISTP instruction |
| 2   | X   | X   |     | Doc   | FIDIV/FIDIVR m16int description                                                                  |
| 3   | X   | Χ   |     | Doc   | PUSH does not pad with zeros                                                                     |
| 4   | Х   | Х   |     | Doc   | DR7, bit 10 is reserved                                                                          |
| 5   | Х   | Х   |     | Doc   | Cache and TLB description correction                                                             |
| 6   | Х   | Х   |     | Doc   | SMRAM state save map contains documentation error                                                |
| NO. | dA0 | dA1 | SUB | Plans | SPECIFICATION CLARIFICATIONS                                                                     |
| 1   | Х   | Х   |     | Doc   | Writes to WC memory                                                                              |
| 2   | Х   | Х   |     | Doc   | Multiple processors protocol and restrictions                                                    |
| 3   | Х   | Х   |     | Doc   | Critical sequence of events during a page fault exception                                        |
| 4   | Х   | Х   |     | Doc   | Performance-monitoring counter issues                                                            |
| 5   | Χ   | Χ   |     | Doc   | POP[ESP] with 16-bit stack size                                                                  |

### ERRATA

### 1. FP Data Operand Pointer May Be Incorrectly Calculated After FP Access Which Wraps 64-Kbyte Boundary in 16-Bit Code

**PROBLEM:** The FP Data Operand Pointer is the effective address of the operand associated with the last noncontrol floating-point instruction executed by the machine. If an 80-bit floating-point access (load or store) occurs in a 16-bit mode other than protected mode (in which case the access will produce a segment limit violation), the memory access wraps a 64-Kbyte boundary, and the floating-point environment is subsequently saved in 32-bit mode, the subtraction routine used to calculate the FP Data Operand Pointer will assume the floating-point access was in 32-bit mode, and the high word of the address will be FFFFh instead of 0000h.

**IMPLICATION:** A 32-bit operating system running 16-bit floating-point code may encounter this erratum, under the following conditions:

- The operating system is using a segment greater than 64 Kbytes in size.
- An application is running in a 16-bit mode other than protected mode.
- An 80-bit floating-point load which wraps the 64-Kbyte boundary is executed.
- The operating system uses a 32-bit handler on an unmasked exception which occurs during the load.

• The exception handler uses the value contained in the FP Data Operand Pointer. Wrapping an 80-bit floating-point load around a segment boundary in this way is not a normal programming practice. Intel has not currently identified any software which exhibits this behavior.

**WORKAROUND:** If the FP Data Operand Pointer is used in a 32-bit exception handler in an OS which may run 16-bit floating-point code, care must be taken to ensure that no 80-bit floating-point accesses are wrapped around a 64-Kbyte boundary.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 2. Differences Exist in Debug Exception Reporting

**PROBLEM:** There exist some differences in the reporting of code and data breakpoint matches between that specified by previous Intel processors' specifications and the behavior of the Intel Celeron processor, as described below:

int<sub>el</sub>.

INTEL CELERON<sup>TM</sup> PROCESSOR SPECIFICATION UPDATE

### CASE 1:

The first case is for a breakpoint set on a MOVSS or POPSS instruction, when the instruction following it causes a debug register protection fault (DR7.gd is already set, enabling the fault). The Pentium<sup>®</sup> processor reports delayed data breakpoint matches from the MOVSS or POPSS instructions by setting the matching DR6.bi bits, along with the debug register protection fault (DR6.bd). If additional breakpoint faults are matched during the call of the debug fault handler, the Pentium processor sets the breakpoint match bits (DR6.bi) to reflect the breakpoints matched by both the MOVSS or POPSS breakpoint and the debug fault handler call. The Intel Celeron processor only sets DR6.bd in either situation, and does not set any of the DR6.bi bits.

### CASE 2:

In the second breakpoint reporting failure case, if a MOVSS or POPSS instruction with a data breakpoint is followed by a store to memory which crosses a 4-Kbyte page boundary, the breakpoint information for the MOVSS or POPSS will be lost. Previous processors retain this information across such a page split.

### CASE 3:

If they occur after a MOVSS or POPSS instruction, the INT *n*, INTO, and INT3 instructions zero the DR6.Bi bits (bits B0 through B3), clearing pending breakpoint information, unlike previous processors.

### CASE 4:

If a data breakpoint and an SMI (System Management Interrupt) occur simultaneously, the SMI will be serviced via a call to the SMM handler, and the pending breakpoint will be lost.

IMPLICATION: When debugging or when developing debuggers for a Intel Celeron processor-based system, this behavior should be noted. Normal usage of the MOVSS or POPSS instructions (i.e., following them with a MOV ESP) will not exhibit the behavior of cases 1-3. Debugging in conjunction with SMM will be limited by case 4 (no workaround has been identified for this case).

**WORKAROUND:** Following MOVSS and POPSS instructions with a MOV ESP instruction when using breakpoints will avoid the first three cases of this erratum.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

# int<sub>el</sub>,

### 3. Code Fetch Matching Disabled Debug Register May Cause Debug Exception

**PROBLEM:** The bits L0-3 and G0-3 enable breakpoints local to a task and global to all tasks, respectively. If one of these bits is set, a breakpoint is enabled, corresponding to the addresses in the debug registers DR0-DR3. If at least one of these breakpoints is enabled, any of these registers are **disabled** (i.e., Ln and Gn are 0), and RWn for the disabled register is 00 (indicating a breakpoint on instruction execution), normally an instruction fetch will not cause an instruction-breakpoint fault based on a match with the address in the disabled register (s). However, if the address in a disabled register matches the address of a code fetch which also results in a page fault, an instruction-breakpoint fault will occur.

IMPLICATION: While debugging software, extraneous instruction-breakpoint faults may be encountered if breakpoint registers are not cleared when they are disabled. Debug software which does not implement a code breakpoint handler will fail, if this occurs. If a handler is present, the fault will be serviced. Mixing data and code may exacerbate this problem by allowing disabled data breakpoint registers to break on an instruction fetch.

**WORKAROUND:** The debug handler should clear breakpoint registers before they become disabled.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 4. FP Inexact-Result Exception Flag May Not Be Set

**PROBLEM:** When the result of a floating-point operation is not exactly representable in the destination format (1/3 in binary form, for example), an inexact-result (precision) exception occurs. When this occurs, the PE bit (bit 5 of the FPU status word) is normally set by the processor. Under certain rare conditions, this bit may not be set when this rounding occurs. However, other actions taken by the processor (invoking the software exception handler if the exception is unmasked) are not affected. This erratum can only occur if the floating-point operation which causes the precision exception is immediately followed by one of the following instructions:

- FST m32real
- FST m64real
- FSTP m32real
- FSTP m64real
- FSTP m80real
- FIST m16int
- FIST m32int

# int<sub>el</sub>,

- FISTP m16int
- FISTP m32int
- FISTP m64int

Note that even if this combination of instructions is encountered, there is also a dependency on the internal pipelining and execution state of both instructions in the processor.

**IMPLICATION:** Inexact-result exceptions are commonly masked or ignored by applications, as it happens frequently, and produces a rounded result acceptable to most applications. The PE bit of the FPU status word may not always be set upon receiving an inexact-result exception. Thus, if these exceptions are unmasked, a floating-point error exception handler may not recognize that a precision exception occurred. Note that this is a "sticky" bit, i.e., once set by an inexact-result condition, it remains set until cleared by software.

**WORKAROUND:** This condition can be avoided by inserting a NOP instruction between the two floating-point instructions.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 5. BTM for SMI Will Contain Incorrect FROM EIP

**PROBLEM:** A system management interrupt (SMI) will produce a Branch Trace Message (BTM), if BTMs are enabled. However, the FROM EIP field of the BTM (used to determine the address of the instruction which was being executed when the SMI was serviced) will not have been updated for the SMI, so the field will report the same FROM EIP as the previous BTM.

**IMPLICATION:** A BTM which is issued for an SMI will not contain the correct FROM EIP, limiting the usefulness of BTMs for debugging software in conjunction with System Management Mode (SMM).

WORKAROUND: None identified at this time.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 6. I/O Restart in SMM May Fail After Simultaneous MCE

**PROBLEM:** If an I/O instruction (IN, INS, REP INS, OUT, OUTS, or REP OUTS) is being executed, and if the data for this instruction becomes corrupted, the Intel Celeron processor will signal a machine check exception (MCE). If the instruction is directed at a device which is powered down, the processor may also receive an assertion of SMI#. Since MCEs have higher priority, the processor will call the MCE handler, and the

SMI# assertion will remain pending. However, upon attempting to execute the first instruction of the MCE handler, the SMI# will be recognized and the processor will attempt to execute the SMM handler. If the SMM handler is completed successfully, it will attempt to restart the I/O instruction, but will not have the correct machine state, due to the call to the MCE handler.

Into

**IMPLICATION:** A simultaneous MCE and SMI# assertion may occur for one of the I/O instructions above. The SMM handler may attempt to restart such an I/O instruction, but will have corrupted state due to the MCE handler call, leading to failure of the restart and SHUTDOWN of the processor.

**WORKAROUND:** If a system implementation must support both SMM and MCEs, the first thing the SMM handler code (when an I/O restart is to be performed) should do is check for a pending MCE. If there is an MCE pending, the SMM handler should immediately exit via an RSM instruction and allow the machine check exception handler to execute. If there is not, the SMM handler may proceed with its normal operation.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 7. Branch Traps Do Not Function If BTMs Are Also Enabled

**PROBLEM:** If branch traps or branch trace messages (BTMs) are enabled alone, both function as expected. However, if both are enabled, only the BTMs will function, and the branch traps will be ignored.

**IMPLICATION:** The branch traps and branch trace message debugging features cannot be used together.

**WORKAROUND:** If branch trap functionality is desired, BTMs must be disabled. **STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 8. Machine Check Exception Handler May Not Always Execute Successfully

**PROBLEM:** An asynchronous machine check exception (MCE), such as a BINIT# event, which occurs during an access that splits a 4-Kbyte page boundary may leave some internal registers in an indeterminate state. Thus, MCE handler code may not always run successfully if an asynchronous MCE has occurred previously.

**IMPLICATION:** An MCE may not always result in the successful execution of the MCE handler. However, asynchronous MCEs usually occur upon detection of a catastrophic system condition that would also hang the processor. Leaving MCEs disabled will result in the condition which caused the asynchronous MCE instead causing the

int<sub>el</sub>,

processor to enter SHUTDOWN. Therefore, leaving MCEs disabled may not improve overall system behavior.

**WORKAROUND:** No workaround which would guarantee successful MCE handler execution under this condition has been identified.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 9. LBER May Be Corrupted After Some Events

**PROBLEM:** The last branch record (LBR) and the last branch before exception record (LBER) can be used to determine the source and destination information for previous branches or exceptions. The LBR contains the source and destination addresses for the last branch or exception, and the LBER contains similar information for the last branch taken before the last exception. This information is typically used to determine the location of a branch which leads to execution of code which causes an exception. However, after a catastrophic bus condition which results in an assertion of BINIT# and the reinitialization of the buses, the value in the LBER may be corrupted. Also, after either a CALL which results in a fault or a software interrupt, the LBER and LBR will be updated to the same value, when the LBER should not have been updated. **IMPLICATION:** The LBER and LBR registers are used only for debugging purposes. When this erratum occurs, the LBER will not contain reliable address information. The value of LBER should be used with caution when debugging branching code; if the values in the LBER and LBER are the same, then the LBER value is incorrect. Also, the value in the LBER should not be relied upon after a BINIT# event.

WORKAROUND: None identified at this time.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 10. BTMs May Be Corrupted During Simultaneous L1 Cache Line Replacement

**PROBLEM:** When Branch Trace Messages (BTMs) are enabled and such a message is generated, the BTM may be corrupted when issued to the bus by the L1 cache if a new line of data is brought into the L1 data cache simultaneously. Though the new line being stored in the L1 cache is stored correctly, and no corruption occurs in the data, the information in the BTM may be incorrect due to the internal collision of the data line and the BTM.

IMPLICATION: Although BTMs may not be entirely reliable due to this erratum, the conditions necessary for this boundary condition to occur have only been exhibited

during focused simulation testing. Intel has currently not observed this erratum in a system level validation environment.

int

WORKAROUND: None identified at this time.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 11. Potential Early Deassertion of LOCK# during Split-Lock Cycles

**PROBLEM:** During a split-lock cycle there are four bus transactions: 1st ADS# (a partial read), 2nd ADS# (a partial read), 3rd ADS# (a partial write), and the 4th ADS# (a partial write). Due to this erratum, LOCK# may deassert one clock after the 4th ADS# of the split-lock cycle instead of after the RS# assertion corresponding to the 4th ADS# has been sampled. The following sequence of events are required for this erratum to occur:

- 1. A lock cycle occurs (split or nonsplit).
- 2. Five more bus transactions (assertion of ADS#) occur.
- 3. A split-lock cycle occurs and BNR# toggles after the 3rd ADS# (partial write) of the split-lock cycle. This in turn delays the assertion of the 4th ADS# of the split-lock cycle. BNR# toggling at this time could most likely happen when the bus is set for an IOQ depth of 2.

When all of these events occur, LOCK# will be deasserted in the next clock after the 4th ADS# of the split-lock cycle.

**IMPLICATION:** This may affect chipset logic which monitors the behavior of LOCK# deassertion.

WORKAROUND: None identified.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 12. A20M# May Be Inverted After Returning From SMM and Reset

**PROBLEM:** This erratum is seen when software causes the following events to occur:

- 1. The assertion of A20M# in real address mode.
- 2. After entering the 1-Mbyte address wrap-around mode caused by the assertion of A20M#, there is an assertion of SMI# intended to cause a Reset or remove power to the processor. Once in the SMM handler, software saves the SMM state save map to an area of nonvolatile memory from which it can be restored at some point in the future. Then software asserts RESET# or removes power to the processor.

# intel celeron<sup>TM</sup> processor specification update

3. After exiting Reset or completion of power-on, software asserts SMI# again. Once in the SMM handler, it then retrieves the old SMM state save map which was saved in event 2 above and copies it into the current SMM state save map. Software then asserts A20M# and executes the RSM instruction. After exiting the SMM handler, the polarity of A20M# is inverted.

**IMPLICATION:** If this erratum occurs, A20M# will behave with a polarity opposite from what is expected (i.e., the 1-Mbyte address wrap-around mode is enabled when A20M# is deasserted, and does not occur when A20M# is asserted).

**WORKAROUND:** Software should save the A20M# signal state in nonvolatile memory before an assertion of RESET# or a power down condition. After coming out of Reset or at power on, SMI# should be asserted again. During the restoration of the old SMM state save map described in event 3 above, the entire map should be restored, except for bit 5 of the byte at offset 7F18h. This bit should retain the value assigned to it when the SMM state save map was created in event 3. The SMM handler should then restore the original value of the A20M# signal.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 13. Reporting of Floating-Point Exception May be Delayed

**PROBLEM:** The Intel Celeron processor normally reports a floating-point exception for an instruction when the next floating-point or MMX<sup>TM</sup> technology instruction is executed. The assertion of FERR# and/or the INT 16 interrupt corresponding to the exception may be delayed until the floating-point or MMX technology instruction **after** the one which is expected to trigger the exception, if the following conditions are met:

- 1. A floating-point instruction causes an exception.
- 2. Before another floating-point or MMX<sup>TM</sup> technology instruction, any one of the following occurs:
  - a. A subsequent data access occurs to a page which has not been marked as accessed, or
  - b. Data is referenced which crosses a page boundary, or
  - c. A possible page-fault condition is detected which, when resolved, completes without faulting.
- 3. The instruction causing event 2 above is followed by a MOVQ or MOVD store instruction.

IMPLICATION: This erratum only affects software which operates with floating-point exceptions unmasked. Software which requires floating-point exceptions to be visible

on the next floating-point or MMX technology instruction, and which uses floatingpoint calculations on data which is then used for MMX technology instructions, may see a delay in the reporting of a floating-point instruction exception in some cases. Note that mixing floating-point and MMX technology instructions in this way is not recommended.

**WORKAROUND:** Inserting a WAIT or FWAIT instruction (or reading the floating-point status register) between the floating-point instruction and the MOVQ or MOVD instruction will give the expected results. This is already the recommended practice for software.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 14. Near CALL to ESP Creates Unexpected EIP Address

**PROBLEM:** As documented, the CALL instruction saves procedure linking information in the procedure stack and jumps to the called procedure specified with the destination (target) operand. The target operand specifies the address of the first instruction in the called procedure. This operand can be an immediate value, a general purpose register, or a memory location. When accessing an absolute address indirectly using the stack pointer (ESP) as a base register, the base value used is the value in the ESP register before the instruction executes. However, when accessing an absolute address directly using ESP as the base register, the base value used is the value of ESP **after** the return value is pushed on the stack, not the value in the ESP register **before** the instruction executed.

**IMPLICATION:** Due to this erratum, the processor may transfer control to an unintended address. Results are unpredictable, depending on the particular application, and can range from no effect to the unexpected termination of the application due to an exception. Intel has observed this erratum only in a focused testing environment. Intel has not observed any commercially available operating system, application, or compiler that makes use of or generates this instruction.

**WORKAROUND:** If the other seven general purpose registers are unavailable for use, and it is necessary to do a CALL via the ESP register, first push ESP onto the stack, then perform an **indirect** call using ESP (e.g., CALL [ESP]). The saved version of ESP should be popped off the stack after the call returns.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 15. Built-in Self-test Always Gives Nonzero Result

**PROBLEM:** The Built-in Self-test (BIST) of the Intel Celeron processor does not give a zero result to indicate a passing test. Regardless of pass or fail status, bit 6 of the BIST result in the EAX register after running BIST is set.

**IMPLICATION:** Software which relies on a zero result to indicate a passing BIST will indicate BIST failure.

**WORKAROUND:** Mask bit 6 of the BIST result register when analyzing BIST results. **STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 16. THERMTRIP# May Not be Asserted as Specified

**PROBLEM:** THERMTRIP# is a signal on the Intel Celeron processor which is asserted when the core reaches a critical temperature during operation as detailed in the processor specification. The Intel Celeron processor may not assert THERMTRIP# until a much higher temperature than the one specified is reached.

**IMPLICATION:** The THERMTRIP# feature is not functional on the Intel Celeron processor. Note that this erratum can only occur when the processor is running with a  $T_{PLATE}$  temperature over the maximum specification of 75 °C.

**WORKAROUND:** Avoid operation of the Intel Celeron processor outside of the thermal specifications defined by the processor specifications.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 17. Cache State Corruption in the Presence of Page A/D-bit Setting and Snoop Traffic

**PROBLEM:** If an operating system uses the Page Access and/or Dirty bit feature implemented in the Intel architecture and there is a significant amount of snoop traffic on the bus, while the processor is setting the Access and/or Dirty bit the processor may inappropriately change a single L1 cache line to the modified state.

**IMPLICATION:** The occurrence of this erratum may result in cache incoherency, which may cause parity errors, data corruption (with no parity error), unexpected application or operating system termination, or system hangs.

**WORKAROUND:** It is possible for BIOS code to contain a workaround for this erratum. **STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 18. Snoop Cycle Generates Spurious Machine Check Exception

**PROBLEM:** The processor may incorrectly generate a Machine Check Exception (MCE) when it processes a snoop access that does not hit the L1 data cache. Due to an internal logic error, this type of snoop cycle may still check data parity on undriven data lines. The processor generates a spurious machine check exception as a result of this unnecessary parity check.

Into

IMPLICATION: A spurious machine check exception may result in an unexpected system halt if Machine Check Exception reporting is enabled in the operating system. WORKAROUND: It is possible for BIOS code to contain a workaround for this erratum. STATUS: For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 19. MOVD/MOVQ Instruction Writes to Memory Prematurely

**PROBLEM:** When an instruction encounters a fault, the faulting instruction should not modify any CPU or system state. However, when the MMX technology store instructions MOVD and MOVQ encounter any of the following events, it is possible for the store to be committed to memory even though it should be canceled:

- 1. If CR0.EM = 1 (Emulation bit), then the store could happen prior to the triggered invalid opcode exception.
- 2. If the floating-point Top-of-Stack (FP TOS) is not zero, then the store could happen prior to executing the processor assist routine that sets the FP TOS to zero.
- 3. If there is an unmasked floating-point exception pending, then the store could happen prior to the triggered unmasked floating-point exception.
- 4. If CR0.TS = 1 (Task Switched bit), then the store could happen prior to the triggered Device Not Available (DNA) exception.

If the MOVD/MOVQ instruction is restarted after handling any of the above events, then the store will be performed again, overwriting with the expected data. The instruction will not be restarted after event 1. The instruction will definitely be restarted after events 2 and 4. The instruction may or may not be restarted after event 3, depending on the specific exception handler.

**IMPLICATION:** This erratum causes unpredictable behavior in an application if MOVD/MOVQ instructions are used to manipulate semaphores for multiprocessor synchronization, or if these MMX instructions are used to write to uncacheable memory or memory mapped I/O that has side effects, e.g., graphics devices. This erratum is completely transparent to all applications that do not have these characteristics. When each of the above conditions are analyzed:

# intel celeron<sup>TM</sup> processor specification update

- 1. Setting the CR0.EM bit forces all floating-point/MMX<sup>TM</sup> instructions to be handled by software emulation. The MOVD/MOVQ instruction, which is an MMX instruction, would be considered an invalid instruction. Operating systems typically terminates the application after getting the expected invalid opcode fault.
- 2. The FP TOS not equal to 0 case only occurs when the MOVD/MOVQ store is the first MMX instruction in an MMX technology routine and the previous floating-point routine did not clean up the floating-point states properly when it exited. Floating-point routines commonly leave TOS to 0 prior to exiting. For a store to be executed as the first MMX instruction in an MMX technology routine following a floating-point routine, the software would be implementing instruction level intermixing of floating-point and MMX instructions. Intel does not recommend this practice.
- 3. The unmasked floating-point exception case only occurs if the store is the first MMX technology instruction in an MMX technology routine and the previous floating-point routine exited with an unmasked floating-point exception pending. Again, for a store to be executed as the first MMX instruction in an MMX technology routine following a floating-point routine, the software would be implementing instruction level intermixing of floating-point and MMX instructions. Intel does not recommend this practice.
- 4. Device Not Available (DNA) exceptions occur naturally when a task switch is made between two tasks that use either floating-point instructions and/or MMX instructions. For this erratum, in the event of the DNA exception, data from the prior task may be temporarily stored to the present task's program state.

**WORKAROUND:** Do not use MMX instructions to manipulate semaphores for multiprocessor synchronization. Do not use MOVD/MOVQ instructions to write directly to I/O devices if doing so triggers user visible side effects. An OS can prevent old data from being stored to a new task's program state by cleansing the FPU explicitly after every task switch. Follow Intel's recommended programming paradigms in the *Intel Architecture Developer's Optimization Manual* for writing MMX technology programs. Specifically, do not mix floating-point and MMX instructions. When transitioning to new a MMX technology routine, begin with an instruction that does not depend on the prior state of either the MMX technology registers or the floating-point registers, such as a load or PXOR mm0, mm0. Be sure that the FP TOS is clear before using MMX instructions.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

#### 20. Memory Type Undefined for Nonmemory Operations

**PROBLEM:** The Memory Type field for nonmemory transactions such as I/O and Special Cycles are undefined. Although the Memory Type attribute for nonmemory operations logically should (and usually does) manifest itself as UC, this feature is not designed into the implementation and is therefore inconsistent.

Intal

**IMPLICATION:** Bus agents may decode a non-UC memory type for nonmemory bus transactions.

**WORKAROUND:** Bus agents must consider transaction type to determine the validity of the Memory Type field for a transaction.

STATUS: For the steppings affected, see the Summary Table of Changes at the beginning of this section.

#### 21. Bus Protocol Conflict With Optimized Chipsets

**PROBLEM:** A "dead" turnaround cycle with no agent driving the address, address parity, request command, or request parity signals must occur between the processor driving these signals and the chipset driving them after asserting BPRI#. The Intel Celeron processor does not follow this protocol. Thus, if a system uses a chipset or third party agent which optimizes its arbitration latency (reducing it to 2 clocks when it observes an active (low) ADS# signal and an inactive (high) LOCK# signal on the same clock that BPRI# is asserted (driven low)), the Intel Celeron processor may cause bus contention during an unlocked bus exchange.

IMPLICATION: This violation of the reduced arbitration latency bus exchange protocol may cause a system-level setup timing violation on the address, address parity, request command, or request parity signals on the system bus. This may result in a system hang or assertion of the AERR# signal, causing spurious corrective action or shutdown of the system, as the system hardware and software dictate. The possibility of failure due to the contention caused by this erratum may be increased due to the processor's internal active pull-up of these signals on the clock after the signals are no longer being driven by the processor.

**WORKAROUND:** If the chipset and third party agents used with the Intel Celeron processor do not optimize their arbitration latency as described above, no action is required. For the 66 MHz Intel Celeron processor, no action is required.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

int<sub>el</sub>,

### INTEL CELERON<sup>TM</sup> PROCESSOR SPECIFICATION UPDATE

### 22. FP Data Operand Pointer May Not be Zero After Power On or Reset

**PROBLEM:** The FP Data Operand Pointer, as specified, should be reset to zero upon power on or Reset by the processor. Due to this erratum, the FP Data Operand Pointer may be nonzero after power on or Reset.

**IMPLICATION:** Software which uses the FP Data Operand Pointer and count on its value being zero after power on or Reset without first executing an FINIT/FNINIT instruction will use an incorrect value, resulting on incorrect behavior of the software.

WORKAROUND: Software should follow the recommendation in Section 8.2 of the Intel Architecture Software Developer's Manual, Volume 3: System Programming Guide (Order Number 243192). This recommendation states that if the FPU will be used, software-initialization code should execute an FINIT/FNINIT instruction following a hardware reset. This will correctly clear the FP Data Operand Pointer to zero. STATUS: For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 23. MOVD Following Zeroing Instruction Can Cause Incorrect Result

**PROBLEM:** An incorrect result may be calculated after these circumstances:

- 1. A register has been zeroed with either a SUB reg, reg instruction or an XOR reg, reg instruction,
- 2. A value is moved with sign extension into the same register's lower 16 bits,
- 3. This register is then copied to an MMX<sup>™</sup> register using the MOVD instruction prior to any other operations on the sign-extended value.

Specifically, the sign may be incorrectly extended into bits 16-31 of the MMX register. Only the MMX register is affected by this erratum.

The erratum only occurs when the 3 following steps occur in the order shown. The erratum may occur with up to 40 intervening instructions that do not modify the sign-extended value between steps 2 and 3.

1. XOR EAX, EAX or SUB EAX, EAX

 MOVSX AX, BL or MOVSX AX, byte ptr <memory address> or MOVSX AX, BX or MOVSX AX, word ptr <memory address> or CBW

3. MOVD MM0, EAX

Note that this erratum may occur with "EAX" replaced with any 32-bit general purpose register, and "AX" with the corresponding 16-bit version of that replacement. "BL" or

"BX" can be replaced with any 8-bit or 16-bit general purpose register. The CBW instruction is specific to the EAX register only.

In the example, EAX is forced to contain 0 by the XOR or SUB instructions. Since the four types of the MOVSX instructions and the CBW instruction modify only bits 15:8 of EAX by sign extending the lower 8 bits of EAX, bits 31:16 of EAX should always contain 0. This implies that when MOVD copies EAX to MM0, bits 31:16 of MM0 should also be 0. Under certain scenarios, bits 31:16 of MM0 are not 0, but are replicas of bit 15 (the 16th bit) of AX. This is noticeable when the value in AX after the MOVSX or CBW instruction is negative, i.e., bit 15 of AX is a 1.

When AX is positive (bit 15 of AX is a 0), MOVD will always produce the correct answer. If AX is negative (bit 15 of AX is a 1), MOVD may produce the right answer or the wrong answer depending on the point in time when the MOVD instruction is executed in relation to the MOVSX or CBW instruction.

**IMPLICATION:** The effect of incorrect execution will vary from unnoticeable, due to the code sequence discarding the incorrect bits, to an application failure. If the MMX application in which MOVD is used to manipulate pixels, it is possible for one or more pixels to exhibit the wrong color or position momentarily. It is also possible for a computational application that uses the MOVD instruction in the manner described above to produce incorrect data. Note that this data may cause an unexpected page fault or general protection fault.

WORKAROUND: There are two possible workarounds for this erratum:

- Rather than using the MOVSX-MOVD or CBW-MOVD pairing to handle one variable at a time, use the sign extension capabilities (PSRAW, etc.) within MMX<sup>TM</sup> for operating on multiple variables. This would result in higher performance as well.
- 2. Insert another operation that modifies or copies the sign-extended value between the MOVSX/CBW instruction and the MOVD instruction as in the example below:

XOR EAX, EAX (or SUB EAX, EAX) MOVSX AX, BL (or other MOVSX or CBW instruction) \*MOV EAX, EAX MOVD MM0, EAX

\* MOV EAX, EAX is used here as it is fairly unobtrusive. Again, EAX can be any 32-bit register. **STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 1AP. APIC Access to Cacheable Memory Causes SHUTDOWN

**PROBLEM:** APIC operations which access memory with any type other than uncacheable (UC) are illegal. If an APIC operation to a memory type other than UC occurs and Machine Check Exceptions (MCEs) are disabled, the processor will enter SHUTDOWN after such an access. If MCEs are enabled, an MCE will occur. However, in this circumstance, a second MCE will be signaled. The second MCE signal will cause the Intel Celeron processor to enter SHUTDOWN.

IMPLICATION: Recovery from a PIC access to cacheable memory will not be successful. Software that accesses only UC type memory during APIC operations will not encounter this erratum.

**WORKAROUND:** Ensure that the memory space to which PIC accesses can be made is marked as type UC (uncacheable) in the memory type range registers (MTRRs) to avoid this erratum.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### 2AP. Write to Mask LVT (Programmed as EXTINT) Will Not Deassert Outstanding Interrupt

**PROBLEM:** If the APIC subsystem is configured in Virtual Wire Mode implemented through the local APIC (i.e., the 8259 INTR signal is connected to LINTO and LVT1's interrupt delivery mode field is programmed as EXTINT), a write to LVT1 intended to mask interrupts will not deassert the internal interrupt source if the external LINTO signal is already asserted. The interrupt will be erroneously posted to the Intel Celeron processor despite the attempt to mask it via the LVT.

**IMPLICATION:** Because of the masking attempt, interrupts may be generated when the system software expects no interrupts to be posted.

**WORKAROUND:** Software can issue a write to the 8259A interrupt mask register to deassert the LINTO interrupt level, followed by a read to the controller to ensure that the LINTO signal has been deasserted. Once this is ensured, software may then issue the write to mask LVT entry 1.

**STATUS:** For the steppings affected see the Summary Table of Changes at the beginning of this section.

### **DOCUMENTATION CHANGES**

The Documentation Changes listed in this section apply to the *Intel Celeron*<sup>TM</sup> *Processor at 266 MHz* datasheet and the *Intel Architecture Software Developer's Manual, Volumes 1, 2, and 3.* All Documentation Changes will be incorporated into a future version of the appropriate Intel Celeron processor documentation.

**Note:** The *Intel Architecture Software Developer's Manual*, *Volumes 1, 2, and 3*, applies to all P6 family processors, and therefore some of the documentation changes in this section may not pertain to the Intel Celeron processor specifically.

### 1. Invalid Arithmetic Operations and Masked Responses to Them Relative to FIST/FISTP Instruction

The *Intel Architecture Software Developer's Manual, Volume 1*, Table 7-20 show "Invalid Arithmetic Operations and the Masked Responses to Them." The table entry corresponding to the FIST/FISTP condition is missing, and is shown below:

| Condition                                                                               | Masked Response                       |
|-----------------------------------------------------------------------------------------|---------------------------------------|
| FIST/FISTP instruction when input<br>operand <> MAXINT for destination<br>operand size. | Return MAXNEG to destination operand. |

When FIST/FISTP instruction is executed with input operand <> and the destination operand size is MAXINT, the floating-point zero-divide exception will return MAXNEG to the destination operand as its masked response.

### 2. FIDIV/FIDIVR m16int Description

The *Intel Architecture Software Developer's Manual, Volume 1*, pages 3-118 and 3-122, show in the Description column for the FIDIV *m16int* instruction as "Divide ST(0) by *m64int* by ST(0) and store the result in ST(0)" and FIDIVR *m16int* instruction as "Divide *m64int* by ST(0) and store the result in ST(0)" In both of these cases, *m64int* should be replaced with *m16int*.

### 3. PUSH Does Not Pad With Zeros

The *Intel Architecture Software Developer's Manual, Volume 1*, page 4-3, contain a section regarding stack alignment. The last sentence in the first paragraph of this section, which reads "If a 16-bit value is pushed onto a 32-bit wide stack, the value is

automatically padded with zeros out to 32-bits." should be removed. The PUSH instruction does not pad with zeros.

### 4. DR7, Bit 10 is Reserved

The Intel Architecture Software Developer's Manual, Volume 3: System Programming Guide, shows Figure 14-1, "Debug Registers." Bit 10 of DR7 should be "Reserved" instead of "1."

### 5. Cache and TLB Description Correction

In the *Intel Architecture Software Developer's Manual, Volume 2: Instruction Set Reference*, Table 3-7, the correct description for descriptor value 02H should be as follows:

| Descriptor Value | Cache or TLB Description                                     |
|------------------|--------------------------------------------------------------|
| 02H              | Instruction TLB: 4M-Byte Pages, fully associative, 2 entries |

Also, the third bullet after the table should be as follows:

I. Bytes 1, 2, and 3 of register EAX indicate that the processor contains the following: -01H-A 32-entry instruction TLB (4-way set associative) for mapping 4-Kbytes pages.

-02H-A **2**-entry instruction TLB (**fully** associative) for mapping 4-Mbyte pages. -03H-A 64-entry data TLB (4-way set associative) for mapping 4-Kbyte pages.

For the *Intel Architecture Software Developer's Manual, Volume 3: System Programming Guide*, Table 9-1, the following corrections should be made:

| Cache or Buffer | Characteristics                                                            |
|-----------------|----------------------------------------------------------------------------|
| Instruction TLB | - P6 family processors: 2 entries, fully associative.                      |
| (Large Pages)   | - Pentium <sup>®</sup> processor: Uses same TLB as used for 4-Kbyte pages. |
|                 | - Intel486 <sup>TM</sup> processor: None (large pages not supported).      |

### 6. SMRAM State Save Map Contains Documentation Errors

In the Intel Architecture Software Developer's Manual, Volume 3, System Programming Guide, Chapter 11, "System Management Mode," Table 11-1 incorrectly documents the SMBASE+Offset for IDT Base and GDT Base for Intel Celeron processors.

int<sub>el</sub>.

The storage locations for these parameters are model specific (i.e., they may differ between the Pentium processor, the Pentium Pro processor, Pentium II processor, Intel Celeron processor, and other P6 family proliferations). These entries in the tables above will be changed to Reserved. Hardware and software may not rely on the contents of these Reserved regions.

### SPECIFICATION CLARIFICATIONS

The Specification Clarifications listed in this section apply to the *Intel Celeron*<sup>TM</sup> *Processor at 266 MHz* datasheet and the *Intel Architecture Software Developer's Manual, Volumes 1, 2, and 3.* All Specification Clarifications will be incorporated into a future version of the appropriate Intel Celeron processor documentation.

**Note:** The *Intel Architecture Software Developer's Manual, Volumes 1, 2, and 3,* applies to all P6 family processors, and therefore some of the specification clarifications in this section may not pertain to the Intel Celeron processor specifically.

### 1. Writes to WC Memory

Section 9.3. of the *Intel Architecture Software Developer's Manual, Volume 3: System Programming Guide* identifies that "Writes" to a region of WC memory "may be delayed and combined in the write buffer to reduce memory accesses." This sentence should state that "Writes" to a region of WC memory "may be delayed and combined in the write buffer to reduce memory accesses. The writes may be delayed until the next occurrence of a buffer or processor serialization event, e.g., CPUID execution, a read or write to uncached memory, interrupt occurrence, LOCKed instruction execution, etc., if the WC buffer is partially filled."

### 2. Multiple Processors Protocol and Restrictions

Section 7.6.1. of the *Intel Architecture Software Developer's Manual, Volume 3: System Programming Guide*, contain inconsistencies which will be clarified as follows:

### 7.6.1. Protocol Requirements and Restrictions

The MP protocol imposes the following requirements and restrictions on the system:

- An APIC clock (APICLK) must be provided on all systems based on the Pentium<sup>®</sup> Pro processor.
- All interrupt mechanisms must be disabled for the duration of the MP protocol algorithm including the window of time between the assertion of INIT# or receipt of an INIT IPI by the application processors and the receipt of a STARTUP IPI by the application processors. That is, requests generated by interrupting devices must not be seen by the local APIC unit (on board the processor) until the completion of the algorithm. Failure to disable the interrupt mechanisms may result in processor shutdown.

• The MP protocol should be initiated only after a hardware reset. After completion of the protocol algorithm, a flag is set in the APIC base MSR of the BSP (APIC\_BASE.BSP) to indicate that it is the BSP. This flag is cleared for all other processors. If a processor or the system is subject to an INIT sequence (either through the INIT# pin or an INIT IPI), then the MP protocol is not re-executed. Instead, each processor examines its BSP flag to determine whether the processor should boot or wait for a STARTUP IPI.

Into

### 3. Critical Sequence of Events During a Page Fault Exception

Section 3.6.4., "Page-Directory and Page-Table Entries," in the *Intel Architecture* Software Developer's Manual, Volume 3: System Programming Guide, will be clarified as follows:

If the processor generates a page-fault exception, the operating system must carry out the following operations in this order:

- 1. Copy the page from disk storage into physical memory if needed.
- 2. Load the page address into the page-table or page-directory entry and set its present flag. Other bits, such as the dirty and accessed bits, may also be set at this time.
- 3. Invalidate the current page table entry in the TLB (see Section 3.7., "Translation Lookaside Buffers (TLBs)" for a discussion of TLBs and how to invalidate them).
- 4. Return from the page fault handler to restart the interrupted program or task.

### 4. Performance-Monitoring Counter Issues

The following table documents the characterized differences between the behavior of the Intel Celeron processor's performance-monitoring counters and that documented in Appendix A of the *Intel Architecture Software Developer's Manual, Volume 3: System Programming Guide.* 

The following table replaces Table A-1 of the *Intel Architecture Software Developer's Manual, Volume 3: System Programming Guide*. The only change to this new table are enhanced descriptions of the events counted.

# int<sub>el</sub>,

### INTEL CELERON<sup>TM</sup> PROCESSOR SPECIFICATION UPDATE

| Unit                           | Event<br>Number | Mnemon-<br>ic Event<br>Name | Unit<br>Mask | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          | Comments |
|--------------------------------|-----------------|-----------------------------|--------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------|
| Data<br>Cache<br>Unit<br>(DCU) | 43H             | DATA_<br>MEM_<br>REFS       | 00H          | All loads from any<br>memory type. All<br>stores to any<br>memory type. Each<br>part of a split is<br>counted separately.<br>The internal logic<br>counts not only<br>external memory<br>loads and stores, but<br>also internal retries.<br>Note: 80 bit floating<br>point accesses are<br>double counted,<br>since they are<br>decomposed into a<br>16 bit exponent load<br>and a 64 bit mantissa<br>load.<br>Memory accesses<br>are only counted<br>when they are<br>actually performed.<br>E.g., a load that gets<br>squashed because a<br>previous cache miss<br>is outstanding to the<br>same address, and<br>which finally gets<br>performed, is only<br>counted once.<br>Does not include I/O<br>accesses, or other<br>nonmemory<br>accesses. |          |
|                                | 45H             | DCU_LIN<br>ES_IN            | 00H          | Total lines allocated in the DCU.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |          |
|                                | 46H             | DCU_M_<br>LINES_IN          | 00H          | Number of M state<br>lines allocated in the<br>DCU.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |          |

int<sub>el</sub>.

| Unit                                       | Event<br>Number | Mnemon-<br>ic Event<br>Name            | Unit<br>Mask | Description                                                                                                                                                                                                                                                                                                                                                  | Comments                                                                                                                                                                                                                                                                                |
|--------------------------------------------|-----------------|----------------------------------------|--------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                                            | 47H             | DCU_M_<br>LINES_<br>OUT                | 00H          | Number of M state<br>lines evicted from<br>the DCU. This<br>includes evictions<br>via snoop HITM,<br>intervention or<br>replacement.                                                                                                                                                                                                                         |                                                                                                                                                                                                                                                                                         |
|                                            | 48H             | DCU_<br>MISS_<br>OUT-<br>STAND-<br>ING | 00H          | Weighted number of<br>cycles while a DCU<br>miss is outstanding,<br>incremented by the<br>number of<br>outstanding cache<br>misses at any<br>particular time.<br>Cacheable read<br>requests only are<br>considered.<br>Uncacheable<br>requests are<br>excluded. Read-for-<br>ownerships are<br>counted as well as<br>line fills, invalidates,<br>and stores. | An access that<br>also misses the L2<br>is short-changed<br>by 2 cycles. (i.e.,<br>if count is N<br>cycles, should be<br>N+2 cycles.)<br>Subsequent loads<br>to the same cache<br>line will not result<br>in any additional<br>counts. Count<br>value not precise,<br>but still useful. |
| Instr-<br>uction<br>Fetch<br>Unit<br>(IFU) | 80H             | IFU_<br>IFETCH                         | 00H          | Number of<br>instruction fetches,<br>both cacheable and<br>noncacheable.<br>Including UC<br>fetches.                                                                                                                                                                                                                                                         |                                                                                                                                                                                                                                                                                         |
|                                            | 81H             | IFU_<br>IFETCH_<br>MISS                | 00H          | Number of<br>instruction fetch<br>misses. All<br>instruction fetches<br>that do not hit the<br>IFU, i.e., that<br>produce memory<br>requests. Includes<br>UC accesses.                                                                                                                                                                                       |                                                                                                                                                                                                                                                                                         |

| Unit         | Event<br>Number | Mnemon-<br>ic Event<br>Name | Unit<br>Mask | Description                                                                                                                                                                                                                                                                                      | Comments |
|--------------|-----------------|-----------------------------|--------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------|
|              | 85H             | ITLB_<br>MISS               | 00H          | Number of ITLB misses.                                                                                                                                                                                                                                                                           |          |
|              | 86H             | IFU_<br>MEM_<br>STALL       | 00H          | Number of cycles<br>instruction fetch is<br>stalled, for any<br>reason. Includes IFU<br>cache misses, ITLB<br>misses, ITLB faults<br>and other minor<br>stalls.                                                                                                                                  |          |
|              | 87H             | ILD_<br>STALL               | 00H          | Number of cycles<br>that the instruction<br>length decoder is<br>stalled.                                                                                                                                                                                                                        |          |
| L2<br>Cachel | 28H             | L2_<br>IFETCH               | MESI<br>0FH  | Number of L2<br>instruction fetches.<br>This event indicates<br>that a normal<br>instruction fetch was<br>received by the L2.<br>The count includes<br>only L2 cacheable<br>instruction fetches; it<br>does not include UC<br>instruction fetches. It<br>does not include<br>ITLB miss accesses. |          |
|              | 29Н             | L2_LD                       | MESI<br>0FH  | Number of L2 data<br>loads. This event<br>indicates that a<br>normal, unlocked,<br>load memory access<br>was received by the<br>L2. It includes only<br>L2 cacheable<br>memory accesses; it<br>does not include I/O<br>accesses, other<br>nonmemory<br>accesses, or memory<br>accesses such as   |          |

| Unit | Event<br>Number | Mnemon-<br>ic Event<br>Name | Unit<br>Mask | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | Comments |
|------|-----------------|-----------------------------|--------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------|
|      |                 |                             |              | UC/WT memory<br>accesses. It does<br>include L2 cacheable<br>TLB miss memory<br>accesses.                                                                                                                                                                                                                                                                                                                                                                                                                                        |          |
|      | 2AH             | L2_ST                       | MESI<br>OFH  | Number of L2 data<br>stores. This event<br>indicates that a<br>normal, unlocked,<br>store memory access<br>was received by the<br>L2. Specifically, it<br>indicates that the<br>DCU sent a read-for-<br>ownership request to<br>the L2. It also<br>includes Invalid to<br>Modified requests<br>sent by the DCU to<br>the L2. It includes<br>only L2 cacheable<br>store memory<br>accesses; it does not<br>include I/O accesses,<br>other nonmemory<br>accesses like<br>UC/WT stores. It<br>includes TLB miss<br>memory accesses. |          |
|      | 24H             | L2_LINES<br>_IN             | 00H          | Number of lines allocated in the L2.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |          |
|      | 26H             | L2_LINES<br>_OUT            | 00H          | Number of lines<br>removed from the L2<br>for any reason.                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |          |
|      | 25H             | L2_M_<br>LINES_<br>INM      | 00H          | Number of modified<br>lines allocated in the<br>L2.                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |          |

| Unit                                             | Event<br>Number | Mnemon-<br>ic Event<br>Name | Unit<br>Mask                  | Description                                                                                                                  | Comments                                                                                                                                                                      |
|--------------------------------------------------|-----------------|-----------------------------|-------------------------------|------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                                                  | 27H             | L2_M_<br>LINES_<br>OUTM     | 00H                           | Number of modified<br>lines removed from<br>the L2 for any<br>reason.                                                        |                                                                                                                                                                               |
|                                                  | 2EH             | L2_<br>RQSTS                | MESI<br>0FH                   | Total number of L2 requests.                                                                                                 |                                                                                                                                                                               |
|                                                  | 21H             | L2_ADS                      | 00H                           | Number of L2 address strobes.                                                                                                |                                                                                                                                                                               |
|                                                  | 22H             | L2_DBUS_<br>BUSY            | 00H                           | Number of cycles<br>during which the L2<br>cache data bus was<br>busy.                                                       |                                                                                                                                                                               |
|                                                  | 23Н             | L2_DBUS_<br>BUSY_ RD        | 00H                           | Number of cycles<br>during which the<br>data bus was busy<br>transferring read data<br>from L2 to the<br>processor.          |                                                                                                                                                                               |
| Exter-<br>nal Bus<br>Logic<br>(EBL) <sup>2</sup> | 62H             | BUS_<br>DRDY_<br>CLOCKS     | 00H<br>(Self)<br>20H<br>(Any) | Number of clocks<br>during which<br>DRDY# is asserted.<br>Essentially,<br>utilization of the<br>external system data<br>bus. | Unit Mask = 00H<br>counts bus clocks<br>when the<br>processor is<br>driving DRDY#.<br>Unit Mask = 20H<br>counts in<br>processor clocks<br>when any agent is<br>driving DRDY#. |
|                                                  | 63H             | BUS_<br>LOCK_<br>CLOCKS     | 00H<br>(Self)<br>20H<br>(Any) | Number of clocks<br>during which<br>LOCK# is asserted<br>on the external<br>system bus.                                      | Always counts in processor clocks.                                                                                                                                            |

| Unit | Event<br>Number | Mnemon-<br>ic Event<br>Name       | Unit<br>Mask                  | Description                                                                                                                                                       | Comments                                                                                                                                                                                           |
|------|-----------------|-----------------------------------|-------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|      | 60H             | BUS_REQ<br>_OUT-<br>STAND-<br>ING | 00H<br>(Self)                 | Number of bus<br>requests outstanding.<br>This counter is<br>incremented by the<br>number of cacheable<br>read bus requests<br>outstanding in any<br>given cycle. | Counts only DCU<br>full-line cacheable<br>reads, not RFOs,<br>writes, instruction<br>fetches, or<br>anything else.<br>Counts "waiting<br>for bus to<br>complete" (last<br>data chunk<br>received). |
|      | 65H             | BUS_<br>TRAN_<br>BRD              | 00H<br>(Self)<br>20H<br>(Any) | Number of burst read transactions.                                                                                                                                |                                                                                                                                                                                                    |
|      | 66H             | BUS_<br>TRAN_<br>RFO              | 00H<br>(Self)<br>20H<br>(Any) | Number of<br>completed read for<br>ownership<br>transactions.                                                                                                     |                                                                                                                                                                                                    |
|      | 67H             | BUS_<br>TRANS_W<br>B              | 00H<br>(Self)<br>20H<br>(Any) | Number of<br>completed write<br>back transactions.                                                                                                                |                                                                                                                                                                                                    |
|      | 68H             | BUS_<br>TRAN_<br>IFETCH           | 00H<br>(Self)<br>20H<br>(Any) | Number of<br>completed<br>instruction fetch<br>transactions.                                                                                                      |                                                                                                                                                                                                    |
|      | 69H             | BUS_<br>TRAN_<br>INVAL            | 00H<br>(Self)<br>20H<br>(Any) | Number of<br>completed invalidate<br>transactions.                                                                                                                |                                                                                                                                                                                                    |
|      | 6AH             | BUS_<br>TRAN_<br>PWR              | 00H<br>(Self)<br>20H<br>(Any) | Number of<br>completed partial<br>write transactions.                                                                                                             |                                                                                                                                                                                                    |
|      | 6BH             | BUS_<br>TRANS_P                   | 00H<br>(Self)<br>20H<br>(Any) | Number of<br>completed partial<br>transactions.                                                                                                                   |                                                                                                                                                                                                    |

| Unit | Event<br>Number | Mnemon-<br>ic Event<br>Name | Unit<br>Mask                  | Description                                                                                                                                                                         | Comments                                                                                                  |
|------|-----------------|-----------------------------|-------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------|
|      | 6CH             | BUS_<br>TRANS_<br>IO        | 00H<br>(Self)<br>20H<br>(Any) | Number of<br>completed I/O<br>transactions.                                                                                                                                         |                                                                                                           |
|      | 6DH             | BUS_<br>TRAN_<br>DEF        | 00H<br>(Self)<br>20H<br>(Any) | Number of<br>completed deferred<br>transactions.                                                                                                                                    |                                                                                                           |
|      | 6EH             | BUS_<br>TRAN_<br>BURST      | 00H<br>(Self)<br>20H<br>(Any) | Number of<br>completed burst<br>transactions.                                                                                                                                       |                                                                                                           |
|      | 70H             | BUS_<br>TRAN_<br>ANY        | 00H<br>(Self)<br>20H<br>(Any) | Number of all<br>completed bus<br>transactions. Address<br>bus utilization can be<br>calculated knowing<br>the minimum<br>address bus<br>occupancy. Includes<br>special cycles etc. |                                                                                                           |
|      | 6FH             | BUS_<br>TRAN_<br>MEM        | 00H<br>(Self)<br>20H<br>(Any) | Number of<br>completed memory<br>transactions.                                                                                                                                      |                                                                                                           |
|      | 64H             | BUS_<br>DATA_<br>RCV        | 00H<br>(Self)                 | Number of bus clock<br>cycles during which<br>this processor is<br>receiving data.                                                                                                  |                                                                                                           |
|      | 61H             | BUS_BNR<br>_DRV             | 00H<br>(Self)                 | Number of bus clock<br>cycles during which<br>this processor is<br>driving the BNR#<br>pin.                                                                                         |                                                                                                           |
|      | 7АН             | BUS_HIT_<br>DRV             | 00H<br>(Self)                 | Number of bus clock<br>cycles during which<br>this processor is<br>driving the HIT# pin.                                                                                            | Includes cycles<br>due to snoop<br>stalls.<br>The event counts<br>correctly, but the<br>BPM <i>i</i> pins |

| Unit | Event<br>Number | Mnemon-<br>ic Event<br>Name | Unit<br>Mask  | Description                                                                                  | Comments                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
|------|-----------------|-----------------------------|---------------|----------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|      |                 |                             |               |                                                                                              | function as<br>follows based on<br>the setting of the<br>PC bits (bit 19 in<br>the PerfEvtSel0<br>and PerfEvtSel1<br>registers). If the<br>core clock to bus<br>clock ratio is 2:1<br>or 3:1, and a PC<br>bit is set, the<br>BPM <i>i</i> pins will be<br>asserted for a<br>single clock when<br>the counters<br>overflow. If the<br>PC bit is clear, the<br>processor toggles<br>the BPM <i>i</i> pins<br>when the counter<br>overflows. If the<br>clock ratio is not<br>2:1 or 3:1, the<br>BPM <i>i</i> pins will<br>not function for<br>these<br>performance-<br>monitoring<br>counter events. |
|      | 7BH             | BUS_<br>HITM_<br>DRV        | 00H<br>(Self) | Number of bus clock<br>cycles during which<br>this processor is<br>driving the HITM#<br>pin. | Includes cycles<br>due to snoop<br>stalls.<br>The event counts<br>correctly, but the<br>BPM <i>i</i> pins<br>function as<br>follows based on<br>the setting of the<br>PC bits (bit 19 in<br>the PerfEvtSel0<br>and PerfEvtSel1<br>registers). If the<br>core clock to bus                                                                                                                                                                                                                                                                                                                         |

| Unit                           | Event<br>Number | Mnemon-<br>ic Event<br>Name | Unit<br>Mask  | Description                                                                                                                                                                                                                                                                                                                                                                                  | Comments                                                                                                                                                                                                                                                                                                                                                                                                                          |
|--------------------------------|-----------------|-----------------------------|---------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                                |                 |                             |               |                                                                                                                                                                                                                                                                                                                                                                                              | clock ratio is 2:1<br>or 3:1, and a PC<br>bit is set, the<br>BPM <i>i</i> pins will be<br>asserted for a<br>single clock when<br>the counters<br>overflow. If the<br>PC bit is clear, the<br>processor toggles<br>the BPM <i>i</i> pins<br>when the counter<br>overflows. If the<br>clock ratio is not<br>2:1 or 3:1, the<br>BPM <i>i</i> pins will<br>not function for<br>these<br>performance-<br>monitoring<br>counter events. |
|                                | 7EH             | BUS_<br>SNOOP_<br>STALL     | 00H<br>(Self) | Number of clock<br>cycles during which<br>the bus is snoop<br>stalled.                                                                                                                                                                                                                                                                                                                       |                                                                                                                                                                                                                                                                                                                                                                                                                                   |
| Float-<br>ing<br>Point<br>Unit | СІН             | FLOPS                       | 00H           | Number of<br>computational<br>floating-point<br>operations retired.<br>Excludes floating<br>point computational<br>operations that cause<br>traps or assists.<br>Includes floating<br>point computational<br>operations executed<br>by the assist handler.<br>Includes internal<br>sub-operations of<br>complex floating<br>point instructions<br>like transcendentals.<br>Excludes floating | Counter 0 only                                                                                                                                                                                                                                                                                                                                                                                                                    |

| Unit | Event<br>Number | Mnemon-<br>ic Event<br>Name | Unit<br>Mask | Description                                                                                                                                                                                                                                                                                                                                                                                                              | Comments                                                                             |
|------|-----------------|-----------------------------|--------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------|
|      |                 |                             |              | point loads and stores.                                                                                                                                                                                                                                                                                                                                                                                                  |                                                                                      |
|      | 10H             | FP_COMP<br>_OPS_<br>EXE     | 00H          | Number of<br>computational<br>floating-point<br>operations executed.<br>The number of<br>FADD, FSUB,<br>FCOM, FMULs,<br>integer MULs and<br>IMULs, FDIVs,<br>FPREMs, FSQRTS,<br>integer DIVs and<br>IDIVs. Note not the<br>number of cycles<br>but, the number of<br>operations. This<br>event does not<br>distinguish an FADD<br>used in the middle of<br>a transcendental flow<br>from a separate<br>FADD instruction. | Counter 0 only                                                                       |
|      | 11H             | FP_<br>ASSIST               | 00H          | Number of floating-<br>point exception cases<br>handled by<br>microcode.                                                                                                                                                                                                                                                                                                                                                 | Counter 1 only.<br>This event<br>includes counts<br>due to speculative<br>execution. |
|      | 12H             | MUL                         | 00H          | Number of<br>multiplies. Note:<br>includes integer and<br>well FP multiplies<br>and is speculative.                                                                                                                                                                                                                                                                                                                      | Counter 1 only                                                                       |
|      | 13H             | DIV                         | 00H          | Number of divides.<br>Note: includes<br>integer and FP<br>multiplies and is<br>speculative.                                                                                                                                                                                                                                                                                                                              | Counter 1 only                                                                       |
|      | 14H             | CYCLES_<br>DIV_<br>BUSY     | 00H          | Number of cycles<br>that the divider is<br>busy, and cannot                                                                                                                                                                                                                                                                                                                                                              | Counter 0 only                                                                       |

| Unit                         | Event<br>Number | Mnemon-<br>ic Event<br>Name | Unit<br>Mask | Description                                                                                                                                                                                                                                                                                                                          | Comments |
|------------------------------|-----------------|-----------------------------|--------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------|
|                              |                 |                             |              | accept new divides.<br>Note: includes<br>integer and FP<br>divides, FPREM,<br>FPSQRT, etc. and is<br>speculative.                                                                                                                                                                                                                    |          |
| Mem-<br>ory<br>Order-<br>ing | 03H             | LD_<br>BLOCKS               | 00H          | Number of store<br>buffer blocks.<br>Includes counts<br>caused by preceding<br>stores whose<br>addresses are<br>unknown, preceding<br>stores whose<br>addresses are known<br>to conflict, but<br>whose data is<br>unknown and<br>preceding stores that<br>conflicts with the<br>load, but which<br>incompletely overlap<br>the load. |          |
|                              | 04H             | SB_DRAI<br>NS               | 00H          | Number of store<br>buffer drain cycles.<br>Incremented during<br>every cycle the store<br>buffer is draining.<br>Draining is caused<br>by serializing<br>operations like<br>CPUID,<br>synchronizing<br>operations like<br>XCHG, Interrupt<br>acknowledgment as<br>well as other<br>conditions such as<br>cache flushing.             |          |

| Unit                                                    | Event<br>Number | Mnemon-<br>ic Event<br>Name   | Unit<br>Mask | Description                                                                                                                                                                                                                                                                                                                                                                        | Comments                                                                                                                                                                                                                                                                                       |
|---------------------------------------------------------|-----------------|-------------------------------|--------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                                                         | 05H             | MIS-<br>ALIGN_<br>MEM_<br>REF | 00H          | Number of<br>misaligned data<br>memory references.<br>Incremented by 1<br>every cycle during<br>which either the load<br>or store pipeline<br>dispatches a<br>misaligned uop.<br>Counting is<br>performed if its the<br>first half or second<br>half, or if it is<br>blocked, squashed or<br>misses.<br>Note in this context<br>misaligned means<br>crossing a 64 bit<br>boundary. | It should be noted<br>that<br>MISALIGN_ME<br>M_REF is only an<br>approximation, to<br>the true number of<br>misaligned<br>memory<br>references. The<br>value returned is<br>roughly<br>proportional to the<br>number of<br>misaligned<br>memory accesses,<br>i.e., the size of the<br>problem. |
| Instr-<br>uction<br>Decod-<br>ing and<br>Retire<br>ment | СОН             | INST_<br>RETIRED              | ООН          | Number of instructions retired.                                                                                                                                                                                                                                                                                                                                                    | A hardware<br>interrupt received<br>during/after the last<br>iteration of the REP<br>STOS flow causes<br>the counter to<br>undercount by 1<br>instruction.                                                                                                                                     |
|                                                         | С2Н             | UOPS_<br>RETIRED              | 00H          | Number of UOPs retired.                                                                                                                                                                                                                                                                                                                                                            |                                                                                                                                                                                                                                                                                                |
|                                                         | D0H             | INST_<br>DECOD-<br>ER         | 00H          | Number of instructions decoded.                                                                                                                                                                                                                                                                                                                                                    |                                                                                                                                                                                                                                                                                                |
| Inter-<br>rupts                                         | С8Н             | HW_INT_<br>RX                 | 00H          | Number of hardware interrupts received.                                                                                                                                                                                                                                                                                                                                            |                                                                                                                                                                                                                                                                                                |
|                                                         | С6Н             | CYCLES_<br>INT_<br>MASKED     | 00H          | Number of processor<br>cycles for which<br>interrupts are<br>disabled.                                                                                                                                                                                                                                                                                                             |                                                                                                                                                                                                                                                                                                |

| Unit          | Event<br>Number | Mnemon-<br>ic Event<br>Name                   | Unit<br>Mask | Description                                                                                                                                                                                                                   | Comments |
|---------------|-----------------|-----------------------------------------------|--------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------|
|               | С7Н             | CYCLES_<br>INT_<br>PENDING<br>_AND_<br>MASKED | 00H          | Number of processor<br>cycles for which<br>interrupts are<br>disabled and<br>interrupts are<br>pending.                                                                                                                       |          |
| Bran-<br>ches | C4H             | BR_INST_<br>RETIRED                           | 00H          | Number of branch instructions retired.                                                                                                                                                                                        |          |
|               | С5Н             | BR_MISS_<br>PRED_<br>RETIRED                  | 00H          | Number of<br>mispredicted<br>branches retired.                                                                                                                                                                                |          |
|               | С9Н             | BR_<br>TAKEN_<br>RETIRED                      | 00H          | Number of taken<br>branches retired.                                                                                                                                                                                          |          |
|               | САН             | BR_MISS_<br>PRED_<br>TAKEN_<br>RET            | 00H          | Number of taken<br>mispredictions<br>branches retired.                                                                                                                                                                        |          |
|               | E0H             | BR_INST_<br>DECOD-<br>ED                      | 00H          | Number of branch instructions decoded.                                                                                                                                                                                        |          |
|               | E2H             | BTB_<br>MISSES                                | 00H          | Number of branches<br>that for which the<br>BTB did not produce<br>a prediction                                                                                                                                               |          |
|               | E4H             | BR_<br>BOGUS                                  | 00H          | Number of bogus branches.                                                                                                                                                                                                     |          |
|               | E6H             | BA-<br>CLEARS                                 | 00H          | Number of time<br>BACLEAR is<br>asserted. This is the<br>number of times that<br>a static branch<br>prediction was made,<br>where the branch<br>decoder decided to<br>make a branch<br>prediction because<br>the BTB did not. |          |

| Unit                                   | Event<br>Number | Mnemon-<br>ic Event<br>Name    | Unit<br>Mask | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | Comments |
|----------------------------------------|-----------------|--------------------------------|--------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------|
| Stalls                                 | А2Н             | RE-<br>SOURCE_<br>STALLS       | 00H          | Incremented by one<br>during every cycle<br>that there is a<br>resource related stall.<br>Includes register<br>renaming buffer<br>entries, memory<br>buffer entries. Does<br>not include stalls due<br>to bus queue full, too<br>many cache misses,<br>etc. In addition to<br>resource related<br>stalls, this event<br>counts some other<br>events.<br>Includes stalls<br>arising during branch<br>misprediction<br>recovery, e.g., if<br>retirement of the<br>mispredicted branch<br>is delayed and stalls<br>arising while store<br>buffer is draining<br>from synchronizing<br>operations. |          |
|                                        | D2H             | PARTIAL_<br>RAT_<br>STALLS     | 00H          | Number of cycles or<br>events for partial<br>stalls. Note Includes<br>flag partial stalls.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |          |
| Seg-<br>ment<br>Reg-<br>ister<br>Loads | 06H             | SEG-<br>MENT_<br>REG_<br>LOADS | 00H          | Number of segment<br>register loads                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |          |
| Clocks                                 | 79H             | CPU_CLK<br>_UN-<br>HALTED      | 00H          | Number of cycles<br>during which the<br>processor is not<br>halted.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            |          |

NOTES:

intجا

- Several L2 cache events, where noted, can be further qualified using the Unit Mask (UMSK) field in the PerfEvtSel0 and PerfEvtSel1 registers. The lower 4 bits of the Unit Mask field are used in conjunction with L2 events to indicate the cache state or cache states involved. The Pentium<sup>®</sup> II processor identifies cache states using the "MESI" protocol and consequently each bit in the Unit Mask field represents one of the four states: UMSK[3] = M (8H) state, UMSK[2] = E (4H) state, UMSK[1] = S (2H) state, and UMSK[0] = I (1H) state. UMSK[3:0] = MESI (FH) should be used to collect data for all states; UMSK = 0H, for the applicable events, will result in nothing being counted.
- All of the external bus logic (EBL) events, except where noted, can be further qualified using the Unit Mask (UMSK) field in the PerfEvtSel0 and PerfEvtSel1 registers. Bit 5 of the UMSK field is used in conjunction with the EBL events to indicate whether the processor should count transactions that are self generated (UMSK[5] = 0) or transactions that result from any processor on the bus (UMSK[5] = 1).

#### 5. POP[ESP] with 16-bit Stack Size

In the *Intel Architecture Software Developer's Manual, Volume 2: Instruction Set Reference*, the section regarding "POP–Pop a Value from the Stack," the following note:

"If the ESP register is used as a base register for addressing a destination operand in memory, the POP instruction computes the effective address of the operand after it increments the ESP register."

is incomplete, and should read as follows:

"If the ESP register is used as a base register for addressing a destination operand in memory, the POP instruction computes the effective address of the operand after it increments the ESP register. For the case of a 16-bit stack where ESP wraps to 0h as a result of the POP instruction, the resulting location of the memory write is processor family specific."

In Section 17.23.1. of the Intel Architecture Software Developer's Manual, Volume 3: System Programming Guide, add a new section:

# A POP-to-memory instruction, which uses the stack pointer (ESP) as a base register.

For a POP-to-memory instruction that meets the following conditions:

- 1. The stack segment size is 16-bit.
- 2. Any 32-bit addressing form with the SIB byte specifying ESP as the base register.
- 3. The initial stack pointer is FFFCh(32-bit operand) or FFFEh (16-bit operand) and will wrap around to 0h as a result of the POP operation.

The result of the memory write is processor family specific. For example, in Pentium II, Pentium Pro, and Intel Celeron processors, the result of the memory write is to SS:0h plus any scaled index and displacement. In Pentium and i486 processors, the result of the memory write may be either a stack fault (real mode or protected mode with stack

### INTEL CELERONTM PROCESSOR SPECIFICATION UPDATE

segment size of 64Kbyte), or write to SS:10000h plus any scaled index and displacement (protected mode and stack segment size exceeds 64Kbyte).

intപം