

# Smart Data Accelerator Interface ("SDXI") Specification

Version 1.0a

ABSTRACT: Smart Data Accelerator Interface (SDXI) is a standard for a memory-tomemory data mover and acceleration interface.

This document has been released and approved by the SNIA. The SNIA believes that the ideas, methodologies and technologies described in this document accurately represent the SNIA goals and are appropriate for widespread distribution. Suggestions for revisions should be directed to <u>https://www.snia.org/feedback/</u>

# **SNIA Standard**

October 3, 2024

#### **USAGE**

Copyright © 2024 SNIA. All rights reserved. All other trademarks or registered trademarks are the property of their respective owners.

The SNIA hereby grants permission for individuals to use this document for personal use only, and for corporations and other business entities to use this document for internal use only (including internal copying, distribution, and display) provided that:

- 1. Any text, diagram, chart, table or definition reproduced shall be reproduced in its entirety with no alteration, and,
- 2. Any document, printed or electronic, in which material from this document (or any portion hereof) is reproduced shall acknowledge the SNIA copyright on that material, and shall credit the SNIA for granting permission for its reuse.

Other than as explicitly provided above, you may not make any commercial use of this document or any portion thereof, or distribute this document to third parties. All rights not explicitly granted are expressly reserved to SNIA.

Permission to use this document for purposes other than those enumerated above may be requested by e-mailing tcmd@snia.org. Please include the identity of the requesting individual and/or company and a brief description of the purpose, nature, and scope of the requested use.

All code fragments, scripts, data tables, and sample code in this SNIA document are made available under the following license:

BSD 3-Clause Software License

Copyright (c) 2024, The Storage Networking Industry Association.

Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met:

\* Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer.

\* Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution.

\* Neither the name of The Storage Networking Industry Association (SNIA) nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission.

THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

#### DISCLAIMER

The information contained in this publication is subject to change without notice. The SNIA makes no warranty of any kind with regard to this specification, including, but not limited to, the implied warranties of merchantability and fitness for a particular purpose. The SNIA shall not be liable for errors contained herein or for incidental or consequential damages in connection with the furnishing, performance, or use of this specification.

#### **Revision History**

| Revision   | Date              | Comments                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |
|------------|-------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 1.0a       | October 3, 2024   | <ul> <li>#147 Handling Illegal writes to MMIO_CTL0.fn_gsr</li> <li>#149 Handling Timeout Waiting for Descriptor vI bit to set</li> <li>#158 Destination buffers should be required to allow RW access by the function</li> <li>#160 Software Requirements Around Enqueuing Descriptors and Setting Valid bits</li> <li>#169 Fix Flag Writes</li> <li>#177 Some readers find Figure 3-11 confusing</li> <li>#178 Remove the Vendor Defined Opcode Group</li> <li>#181 Remove CXT_STS read-only requirement</li> </ul> |
| 1.0        | November 28, 2022 | Official Release                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     |
| 0.9.7-rev0 | November 20, 2022 | Approved by SNIA membership                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
| 0.9.7      | September 5, 2022 | Public Review                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |
| 0.9        | July 2021         | First Public Preview                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
| 0.7        | September 2020    | Starting point Draft Contribution to SNIA SDXI TWG                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |

Suggestions for revisions should be directed to <u>https://www.snia.org/feedback/</u>.

#### **SDXI TWG 1.0a Contributors**

Curtis Ballard, HPE Beau Beachamp, MemVerge Richard Brunner, VMware Xiangping Chen, Dell Don Dutile, IBM Paul Hartke, AMD Shyam Iyer, Dell Inc Travis Hamilton, Arm Brian Hirano, Micron Frederick Knight, NetApp Santosh Kumar, SK Hynix James Leighton, Western Digital Bill Martin, Samsung John Maroney, Micron J Metz, AMD William Moyes, AMD Philip Ng, AMD Murali Ravirala, Microsoft Dwight Riley, HPE Alexandre Romana, Arm Glen Sescila, Dell Inc Paul Von Stamwitz, Fujitsu Jason Wohlgemuth, Microsoft

| 1 SDX | (I: OVERVIEW                                             | 8  |
|-------|----------------------------------------------------------|----|
| 1.1   | SCOPE                                                    | 8  |
| 1.2   | DOCUMENTATION CONVENTIONS                                | 8  |
| 1.2.  | 2.1 Shall, Should, May, and Can                          |    |
| 1.2.  | 2.2 Normative vs. Informative                            |    |
| 1.2.  | 2.3 Reserved                                             |    |
| 1.2.  | 2.4 Developer Notes                                      |    |
| 1.2.  | 2.5 Abbreviations and Terminology                        | 9  |
| 1.2.  | 2.6 Other Clarifying Notes                               |    |
| 1.3   | REFERENCES                                               | 10 |
| 1.4   | DOCUMENT CONSTANTS                                       | 11 |
| 2 BAC | CKGROUND                                                 | 12 |
| 2.1   | ARCHITECTED PLATFORM DATA MOVER                          | 13 |
| 2.1.  | .1 SDXI Descriptor Ring                                  |    |
| 2.1.  | .2 Virtualization Support                                |    |
| 2.2   | DMA ADDRESSING MODES                                     | 15 |
| 2.2.  | 2.1 Address Space Identifier Control                     |    |
| 2.2.  | 2.2 SDXI Function Groups                                 |    |
| 2.2.  | 2.3 Unpinned Memory Access                               |    |
| 2.3   | ADDRESS SPACE CONTROL EXAMPLES                           | 16 |
| 2.3.  | <i>B.1</i> Single-Function, Single Address Space Example |    |
| 2.3.  | 3.2 Single-Function, User Mode Access Example            |    |
| 2.3.  | 8.3 Single-Function, Multiple Address Space Example      |    |
| 2.3.  | 8.4 Cross-Function Transfer Example                      |    |
| 2.4   | MODULARITY AND EXPANDABILITY                             | 22 |
| 2.5   | ENDIAN FORMAT SUPPORT                                    | 22 |
| 3 SYS | TEM MEMORY DATA STRUCTURES                               | 23 |
| 3.1   | OVERVIEW                                                 | 23 |
| 3.2   | CONTEXT DATA STRUCTURES                                  | 26 |
| 3.2.  | 2.1 Context Level 2 Table                                |    |
| 3.2.  | 2.2 Context Level 1 Table                                |    |
| 3.2.  | 2.3 Context Control (CXT_CTL)                            |    |
| 3.2.  | 2.4 Context Status (CXT_STS)                             |    |
| 3.2.  | 2.5 Access Key (AKey) Table Entry (AKEY_ENT)             |    |
| 3.3   | SDXI CROSS-FUNCTION ACCESS                               | 34 |
| 3.3.  | 3.1 SDXI Function Group                                  |    |
| 3.3.  | 8.2 Receiver Access Key (RKey) Table                     |    |
| 3.3.  | 8.3 RKey Table Entry                                     |    |
| 3.3.  | 8.4 Receiver Access Key (RKey) Processing                |    |
| 3.4   | ERROR LOG                                                |    |
| 3.4.  | 1.1 Error Log Header Entry (ERRLOG_HD_ENT)               |    |
| 3.4.  | P.2 Error Log Initialization                             |    |
| 3.4.  | P.3 Error Log Processing by Software                     |    |
| 3.5   | ADMINISTRATIVE CONTEXT (CONTEXT 0)                       | 50 |

|   | 3.6  | DAT         | A STEERING HINTS (DSH)                                 | 50              |
|---|------|-------------|--------------------------------------------------------|-----------------|
| 4 | SDX  | I FU        | NCTION AND CONTEXT STATE                               | 51              |
|   | 4.1  | SD          | XI FUNCTION STATE                                      | 51              |
|   | 4.1. | 1           | G0: GSV_STOP State                                     | 52              |
|   | 4.1. | 2           | G1: GSV_INIT State                                     |                 |
|   | 4.1. | 3           | G2: GSV_ACTIVE State                                   |                 |
|   | 4.1. | 4           | G3: GSV_STOPG_SF ("Soft Stopping") State               | 52              |
|   | 4.1. | 5           | G4: GSV_STOPG_HD ("Hard Stopping") State               | 53              |
|   | 4.1. | 6           | G5: GSV_ERROR State                                    | 53              |
|   | 4.1. | 7           | Function Reset and Outstanding DMA Requests            | 53              |
|   | 4.1. | 8           | Activation of the SDXI Function by Software            |                 |
|   | 4.1. | 9           | Stopping of the SDXI Function by Software              |                 |
|   | 4.2  | SD          | XI CONTEXT STATE                                       | 56              |
|   | 4.2. | 1           | S1: CXTV_INVALID State                                 | 57              |
|   | 4.2. | 2           | S2: CXTV_STOP_[SW, FN] States                          | 57              |
|   | 4.2. | 3           | S3: CXTV_ERR_FN                                        | 58              |
|   | 4.2. | 4           | S4: CXTV_RUN.[RDY, EXEC] States                        | 59              |
|   | 4.2. | 5           | S5: CXTV_STOPG_[SW, FN] States                         | 60              |
|   | 4.2. | 6           | Context-Undefined Operation (CXTV_UNDEF)               |                 |
|   | 4.3  | FUN         | ICTION AND CONTEXT OPERATIONS                          | 62              |
|   | 4.3. | 1           | SDXI Memory-Based Data-Structure Hierarchy and Caching |                 |
|   | 4.3. | 2           | Check Valid Context                                    |                 |
|   | 4.3. | 3           | Doorbell Register and Context Signaling                |                 |
|   | 4.3. | 4<br>7      | Starting A Context and Context Signaling               |                 |
|   | 4.3. | 5           | Function and Context Stop Actions                      |                 |
|   | 4.3. | 0           |                                                        |                 |
|   | 4.4  | AIC         | DMIC OPERATION SUPPORT                                 |                 |
|   | 4.4. | 1           | Completion-Status Capabilities                         |                 |
|   | 4.4. | 2           | Completion-Status Modes                                |                 |
| 5 | SDX  | I DE        | SCRIPTOR RING OPERATION                                | 77              |
|   | 5.1  | Des         | SCRIPTOR OPERATIONS                                    | 79              |
|   | 5.2  | ENC         | QUEUING ONE OR MORE DESCRIPTORS                        | 80              |
|   | 5.2. | 1           | Multi-Producer Enqueue                                 | 83              |
|   | 5.3  | Des         | SCRIPTOR PROCESSING                                    | 83              |
|   | 5.4  | Des         | SCRIPTOR ORDERING AND PARALLEL EXECUTION               | 85              |
|   | 5.5  | Des         | SCRIPTOR COMPLETION                                    | 86              |
|   | 5.6  | Mei         | MORY CONSISTENCY MODEL                                 | 86              |
|   | 5.7  | DES         | CRIPTOR CHAINING                                       |                 |
|   | 5.7. | 1           | Extended Descriptors                                   |                 |
|   | 5.8  | Des         | SCRIPTOR DRIVEN INTERRUPTS                             | 87              |
| e | 60V  |             |                                                        | 00              |
| 0 | 6 1  | םם ו<br>ס⊐ת |                                                        | <b>00</b><br>00 |
|   | 61   | ⊔⊏:<br>1    | Common Header and Footer                               |                 |
|   | 6.1  | '<br>2      | Completion Status Block                                |                 |
|   | 6.1  | ∠<br>3      | Attribute Field                                        | ۵2<br>مع        |
|   | 62   | אח<br>אח    | $\Delta BASE ODERATIONS GROUP (DMARASEGRD)$            |                 |
|   | 0.2  | ואוש        | A DAGE OF ENATIONS GROUP (DIMADAGEGRP)                 |                 |

| 6.2        | 2.1   | DmaBaseGrp: DSC_DMAB_NOP                                           |              |
|------------|-------|--------------------------------------------------------------------|--------------|
| 6.2        | 2.2   | DmaBaseGrp: DSC_DMAB_WRT_IMM Operation                             |              |
| 6.2        | 2.3   | DmaBaseGrp: DSC_DMAB_COPY Operation                                |              |
| 6.2        | 2.4   | DmaBaseGrp: DSC_DMAB_REPCOPY Operation                             |              |
| 6.3        | ATO   | DMIC OPERATION GROUP (ATOMICGRP)                                   | 101          |
| 6.4        | Int   | RGRP OPERATION GROUP                                               | 104          |
| 6.4        | 4.1   | IntrGrp DSC_INTR Operation                                         |              |
| 6.5        | ١٩٧   | NDOR DEFINED OPERATION GROUP                                       | 105          |
| 6.6        | Adi   | MINISTRATIVE OPERATION GROUP (ADMINGRP)                            |              |
| 6.0        | 6.1   | Accessing Contexts, Akey Tables, and RKey Table by Index           | 107          |
| 6.0        | 6.2   | Targeting Multiple Contexts with A Single Administrative Operation | 109          |
| 6.0        | 6.3   | AdminGrp DSC_CXT_START_[NM, RS] Operations                         | 110          |
| 6.0        | 6.4   | AdminGrp DSC_CXT_STOP Operation                                    | 113          |
| 6.0        | 6.5   | AdminGrp DSC_AKEY_UPD Operation                                    | 116          |
| 6.0        | 6.6   | AdminGrp DSC_CXT_UPD Operation                                     | 118          |
| 6.0        | 6.7   | AdminGrp: DSC_FN_UPD Operation                                     | 121          |
| 6.0        | 6.8   | AdminGrp DSC_RKEY_UPD Operation                                    | 123          |
| 6.0        | 6.9   | AdminGrp DSC_SYNC Operation                                        | 125          |
| 6.0        | 6.10  | AdminGrp DSC_ADM_INTR Operation                                    | 128          |
| 7 RE       | сом   | MENDED SEQUENCES FOR FUNCTION MANAGEMENT                           |              |
| 7.1        | Fur   | NCTION LEVEL RESOURCES                                             |              |
| 7.1        | 1.1   | Context Level 2 Table Base (MMIO CXT L2) Modification              |              |
| 7.2        | Со    | NTEXT LEVEL RESOURCES                                              |              |
| 7.2        | 2.1   | Context Level 2 Table Entry (CXT L2 ENT) Modification              |              |
|            |       |                                                                    |              |
| 8 SD)      | XI PC | I-EXPRESS DEVICE ARCHITECTURE                                      | 131          |
| 8.1        | SD    | XI FUNCTION CONFIGURATION SPACE REGISTERS                          | 131          |
| 8.1        | 1.1   | Class Code                                                         |              |
| 8.1        | 1.2   | BAR Configuration                                                  |              |
| 8.         | 1.3   | Required Capabilities and Extended Capabilities                    |              |
| 8.2        | MA    | PPING SFUNC VALUES TO PCIE REQUESTER ID VALUES                     |              |
| 8.3        | MA    | PPING SDXI DSH TO PCIE TLP PROCESSING HINTS (PCIE TPH)             | 133          |
| 8.4        | PC    | IE ATOMIC CAPABILITIES DISCOVERY AND ENABLEMENT                    | 133          |
| 9 MM       |       | ONTROL REGISTERS                                                   |              |
| 91         | GE    | NERAL CONTROL AND STATUS REGISTERS                                 | 136          |
| 9.1        | Co    | NTEXT AND RKEY TABLE REGISTERS                                     | 141          |
| 0.2<br>0 3 | FPI   |                                                                    | 1 <i>1</i> 0 |
| 0.0<br>0.1 |       |                                                                    | 1/5          |
| 9.4<br>0 5 |       |                                                                    |              |
| 9.0<br>0.6 |       |                                                                    | 140          |
| 9.0        |       |                                                                    |              |
| 9.7        | DO    | UKBELL DECTIONS AND REGISTERS                                      | 149          |

This document describes SDXI, an architectural data-mover device for server platforms. SDXI aims to provide a variety of benefits including architectural stability, modularity, virtualization-friendly programming interface, both user and kernel mode support, and new capabilities designed to accelerate virtualized workloads.

# 1.1 Scope

# **1.2 Documentation Conventions**

This specification adheres to various conventions found in the "IEEE Standards Board Operations Manual – Clause 6" at "https://standards.ieee.org/about/policies/opman/sect6/". A few are described here.

### 1.2.1 Shall, Should, May, and Can

- The word "shall" is used to indicate mandatory requirements strictly to be followed in order to conform to the Specification and from which no deviation is permitted ("shall" equals "is required to").
- The use of the word "must" is deprecated and shall not be used when stating mandatory requirements; "must" is used only to describe unavoidable situations.
- The use of the word "will" is deprecated and shall not be used when stating mandatory requirements; will is only used in statements of fact.
- The word "should" is used to indicate that among several possibilities one is recommended as particularly suitable, without mentioning or excluding others; or that a certain course of action is preferred but not necessarily required; or that (in the negative form) a certain course of action is deprecated but not prohibited (should equals is recommended that).
- The word "may" is used to indicate a course of action permissible within the limits of the Specification (may equals is permitted).
- The word "can" is used for statements of possibility and capability, whether material, physical, or causal ("can" equals "is able to").

### 1.2.2 Normative vs. Informative

Normative material is information required to implement the standard and is therefore officially part of the standard. Informative material is provided for information only and is therefore not officially part of the standard.

All sections in this document are normative, unless they are explicitly indicated to be informative at the beginning of each section.

### 1.2.3 Reserved

The following applies to the term "Reserved", "rsvd", and "rsv":

- The contents, state, or information are not specified at this time.
- Any field, feature, capability, etc. marked "Reserved", "rsvd", or "rsv" is subject to change.
- We may use these terms interchangeably.

### 1.2.4 Developer Notes

Developer Notes do not specify normative or optional requirements. They are included for clarification and illustration only. These notes are delineated by:

# 1.2.5 Abbreviations and Terminology

| Term                      | Definition                                                                                                                                                   |
|---------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Administrative<br>Context | A context that supports the use of administrative operations used for managing SDXI Functions and their contexts.                                            |
| AKey                      | Access Key. See "3.2.5, Access Key (AKey) Table Entry (AKEY_ENT)".                                                                                           |
| ATS                       | Address Translation Services. Defined in the PCI Express Base Specification.                                                                                 |
| Context                   | A Context refers to a descriptor ring used for executing operations, along with all associated memory data structures such as control and status information |
| Cross-Function            | An operation where the SDXI Function owning the target resource differs from the requesting SDXI Function.                                                   |
| DMA                       | Direct Memory Access                                                                                                                                         |
| DSH                       | Data Steering Hint See "3.6, Data Steering Hints (DSH)".                                                                                                     |
| Function Group            | A collection of SDXI functions that may access each other's memory and interrupt resources.                                                                  |
| GPA                       | Guest Physical Address                                                                                                                                       |
| GVA                       | Guest Virtual Address                                                                                                                                        |
| HPA                       | Host Physical Address                                                                                                                                        |
| HVA                       | Host Virtual Address                                                                                                                                         |
| Ю                         | Input Output                                                                                                                                                 |
| IOMMU                     | IO Memory Management Unit                                                                                                                                    |

| Term          | Definition                                                                                                                                                                                                                                                                                                                                                              |
|---------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Local         | Local or Local Space (referenced in various structures by an "sfunc" field of "0") corresponds to the SDXI function hosting the context and descriptor.                                                                                                                                                                                                                 |
| MMIO          | Memory mapped IO                                                                                                                                                                                                                                                                                                                                                        |
| MMU           | Memory Management Unit                                                                                                                                                                                                                                                                                                                                                  |
| P2V           | An administrative command from a Physical Function that targets a Virtual Function.                                                                                                                                                                                                                                                                                     |
| PASID         | Process Address Space Identifier. The combination of PASID and Requester ID identifies the address space used by a transaction on the PCIe bus.                                                                                                                                                                                                                         |
| PRI           | Page Request Interface. Defined in the PCI Express Base Specification.                                                                                                                                                                                                                                                                                                  |
| Remote        | May be used by an SDXI Function owning a target resource to refer to a separate requesting SDXI Function. May also be used by a requesting SDXI Function to refer to a target resource owned by a different SDXI Function. Also see Cross-Function.                                                                                                                     |
| Requester ID  | The PCIe bus, device and function number used by PCIe SDXI Functions as part of DMA, interrupt and address translation operations.                                                                                                                                                                                                                                      |
| RKey          | Receiver Access Key. See "3.3.2, Receiver Access Key (RKey) Table".                                                                                                                                                                                                                                                                                                     |
| SDXI Function | A device that implements the SDXI specification. This may be either a physical or virtual function.                                                                                                                                                                                                                                                                     |
| sfunc         | An opaque SDXI Function identifier that maps uniquely to an SDXI function. For example, in a PCIe implementation of SDXI, "sfunc" could map to a PCIe Requester ID. This is used when accessing AKey-referenced data structures. The value of "0" is defined to reference the local hosting SDXI function; all other encodings refer to other non-local SDXI functions. |

### 1.2.6 Other Clarifying Notes

#### 1.2.6.1 Index

Index as used in this specification refers to a table index to a table-specific entry. In order to address the corresponding table entry in memory, the index is multiplied by the size of the table-specific entry and added to the beginning address of the table. For example, if a table starting at address 0x1000 has entries that are 64 bytes in size, then an index into that table needs to be multiplied by 64 and added to 0x1000 to determine the correct memory location of the entry.

# 1.3 References

• PCI Express Base Specification, Revision 5.0

# 1.4 Document Constants

These are defined solely for tooling scripts related to the document.

| Constant     | Value | Description                                       |
|--------------|-------|---------------------------------------------------|
| DOC_REVISION | 1.0   | Document Revision. Must be entered manually here. |

#### Table 1–1: Document CONSTANTS<sup>[^0]</sup>

(The entirety of this chapter is informative.)

The concept of a Direct Memory Access (DMA) data mover device to offload software-based copy loops is well-known. Such offloading is desirable to free up CPU execution cycles. But adoption has been mostly limited to specific privileged software and I/O use cases employing very device-specific interfaces that are not forward compatible. The current limitations make user-mode application usage challenging in a non-virtualized environment and nearly impossible in a multi-tenant virtualized environment. The figure below shows a vision for such an architectural data mover.





We have developed the SDXI Platform Data Mover (aka "SDXI") to provide an architectural interface to address these limitations:

- An extensible, forward-compatible interface that is independent of actual data mover implementations and underlying I/O interconnect technology.
- A standard interface for user-mode address-space to address-space data movement without the need of mediation by privileged software once a connection has been established.
- A standard interface for privileged software to control the data mover including connection management and data movement between multiple address spaces
- An interface that can be abstracted or virtualized by privileged software to allow greater compatibility of workloads or virtual machines across different servers.
- A well-defined capability to quiesce, suspend, and resume the architectural state of a per-addressspace data mover to allow "live" workload or virtual machine migration between servers.
- A forward compatible interface that ensures pre-existing software drivers including user-mode drivers can operate on future hardware implementations without modification.
- The ability to incorporate additional offloads in the future without modification to the architectural interface.

# 2.1 Architected Platform Data Mover

The basic SDXI architecture consists of some number of Smart Data accelerators that are enumerated as one or more SDXI functions. Each function may support 1 or more contexts with each context having a single descriptor ring. SR-IOV is optionally supported.





# 2.1.1 SDXI Descriptor Ring

An SDXI descriptor is a naturally aligned 64-byte entry that instructs an SDXI function to perform a given operation. Descriptors are placed in a circular ring that starts at a specified address (ds\_ring\_ptr). The ring is contiguous at the translation level configured for the SDXI function. The ring is configured to contain a given number of descriptor ring entries (ds\_ring\_sz). The number of bytes allocated in memory for the ring is: (ds\_ring\_sz \* 64).

A set of related operations form an operations group. An operation requires a particular descriptor format to indicate parameters for the operation.

The ring and all of its related system memory data structures comprise a "context". SDXI uses a 2-level hierarchy of context tables (Context Table Level 2, Context Table Level 1) to enumerate the components of the context. The concatenation of the offsets used to enumerate a context in both Context tables yields the 16-bit "context\_number" that is associated with the context. (Note that the Context Tables point to a context but are not themselves part of the context.)



#### Figure 2-3: SDXI Descriptor Ring

EntryAddress = ds\_ring\_ptr + ( (Index % ds\_ring\_sz) << 6 ) Write\_Index - Read\_Index <= ds\_ring\_sz

A circular ring requires "start" and "end" indicators. Rather than using memory pointers to track these, an SDXI descriptor ring uses 64-bit \*unsigned\* logical indices to indicate the start (Read\_Index) and end (Write\_Index) of the descriptor ring. This simplifies various calculations for SW and HW alike. The logical

indices need only be mapped onto descriptor ring addresses when writing or reading a ring entry at a given Index. All rings use the same mechanism to communicate between producer and consumer.

From the perspective of software, SDXI has two kinds of descriptor rings:

- A software producer ring. Software writes descriptors on to the ring, increments Write\_Index, and writes to the doorbell to signal the SDXI function that a new descriptor has been written. The SDXI function reads from the ring, increments Read\_Index, and performs the requested operation. With a few exceptions noted in the next list item, all rings are software producer rings.
- A software consumer ring. The SDXI function writes log messages (using the format of descriptors) on to the ring, increments Write\_Index, and can be configured to generate an interrupt to signal that a new message has been written. Software reads from the log ring, increments Read\_Index, and processes the requested message. The Error Log is an example of this kind of ring.

### 2.1.2 Virtualization Support

SDXI is architected specifically to allow operation within a virtualized environment.

Live migration is supported through a variety of accommodations in the programming model. During live migration, a real hardware SDXI implementation may be stopped by the Hypervisor. Once stopped, there is no hidden state. All critical state is located in either MMIO registers or in system memory. Another instance may be configured with this state and restarted. It is possible to arbitrarily transition between hardware and software emulated implementations.

The programming model allows a Hypervisor to hide features from newer implementations and specifications from a VM. This facilitates migration of a VM built around older versions of the specification onto newer hardware.

Specific use cases around virtualized environments may be accelerated by SDXI. See "2.3, Address Space Control Examples".

An assignable entity is a portion of an SDXI device which may be assigned to a VM. An individual SDXI Function, regardless of its device model, represents an assignable entity. For example, in a PCIe SR-IOV implementation of SDXI, each Physical Function (PF) and Virtual Function (VF) would represent separate SDXI Functions. In general, it is assumed that a Hypervisor will control SDXI PFs and allow SDXI VFs to be directly assigned into VMs.

# 2.2 DMA Addressing Modes

SDXI Functions generate DMA requests with address space identifiers that allow a platform-specific IOMMU to map the associated addresses. Depending on the overall configuration of the SDXI Function and the IOMMU, DMA addresses may be treated as Guest Virtual, Guest Physical or Host Physical for translation purposes.

SDXI provides granular control over the address space identifier used when accessing various SDXI memory data structures (ex. descriptor ring) and each data buffer. These controls allow a single SDXI operation to reference multiple address spaces. For example, an SDXI copy operation may copy data from one address space to another.

DMA requests may be generated with a PASID value, or without PASID as a part of the address space identifier. Additionally, SDXI utilizes the SDXI Function identifier as part of the address space identifier. For example, an SDXI Function constructed as a PCIe SR-IOV VF would use the VF's PCIe Requester ID as the Function identifier when making DMA requests to data structures owned by the VF.

### 2.2.1 Address Space Identifier Control

SDXI provides control over the address space identifier used when accessing various control structures in memory. Specifically, the PASID portion is controlled, depending on the structure, through either an mmio

register or a memory data structure (Context Level 1 Table Entry). The SDXI Function identifier used when accessing control structures is always that of the Function owning the control structures and executing the descriptor.

SDXI determines the address space of each data buffer using an AKey value provided in the descriptor, which is used to look up an AKey table entry in memory. An AKey table entry contains PASID controls which help form the address space identifier used to access the data buffer. Additionally, an AKey table entry contains a tgt\_sfunc value which also helps form the data buffer's address space identifier. The sfunc value is an opaque handle used to abstract the SDXI Function identifier.

Use of the tgt\_sfunc mechanism to access a data buffer owned by a different SDXI Function than the one executing the descriptor is referred to as a Cross-Function operation. Two or more SDXI Functions are involved. This mechanism may be used, for example, to perform a transfer of data from one VM to another.

Cross-Function operations utilize an additional RKey mechanism to authorize the data buffer access. (See "3.3, SDXI Cross-Function Access" for more details).

### 2.2.2 SDXI Function Groups

Cross-Function operations are only supported between SDXI Functions that reside within the same SDXI function group. An SDXI PF and its associated VFs always belong to the same function group. A function group may contain multiple PFs and VFs. A system may contain multiple disconnected function groups. Function groups are discovered using the MMIO\_GRP\_ENUM register (see "3.3.1, SDXI Function Group" for details). The method of communication between SDXI functions within a function group is outside the scope of this specification.

### 2.2.3 Unpinned Memory Access

SDXI supports the use of unpinned memory. For example this can be accomplished in an PCIe implementation through the use of the PCIe-defined ATS and PRI features. The SDXI programming model does not explicitly identify whether data structures reside in unpinned memory. If the PCIe ATS and PRI features are enabled within a function, an SDXI implementation must assume that any memory accesses associated with that function's address spaces may target unpinned memory.

# 2.3 Address Space Control Examples

The address space identifier controls in SDXI enable a wide range of capabilities for accessing data in memory. The following sub-sections describe several example usage models.

# 2.3.1 Single-Function, Single Address Space Example

In this example, a single SDXI Function is utilized. All control and data structures are located in the same address space. This may, for example, represent a case where SDXI is controlled by a kernel driver and is accessing kernel data structures within a VM or a bare metal OS.



Figure 2-4 Single Address Space Example

### 2.3.2 Single-Function, User Mode Access Example

In this example, control of the SDXI Function is split between a kernel mode driver and a user mode driver. The kernel mode driver manages function level control structures and a portion of each SDXI in kernel memory. The user mode driver manages descriptors in user space memory and accesses data buffers in the same user space.



Figure 2-5 User Mode Access Example

### 2.3.3 Single-Function, Multiple Address Space Example

Building on the prior example, this case moves one of the data buffers into a different address space. This may be used, for example, to copy data between different user spaces or between kernel and user space.



Figure 2-6 Multiple Address Space Transfer Example

# 2.3.4 Cross-Function Transfer Example

In this example there are multiple SDXI Functions. SDXI Function Cntl is mapped to VM0, SDXI Function Src is mapped to VM1 and SDXI Function Dst is mapped to VM2. SDXI Function Cntl is executing descriptors from a user process in VM0 and is instructed to perform a Cross-Function copy of data using Function Src to access the source buffer and Function Dst to access the destination buffer. Additionally, the address space controls point the source buffer to a user process in VM1 and the destination buffer to a user process in VM2. As with the prior examples, data buffers may be located in kernel or user space within each of the VMs. "*Figure 2-4 Single Address Space Example*" shows the example data flow where all of the SDXI functions reside within the same SDXI device. "*Figure 2-5 User Mode Access Example*" shows a similar example where the SDXI functions reside within 2 different SDXI devices.









# 2.4 Modularity and Expandability

SDXI is architected as a modular and expandable framework for offload functions. While the initial specification is focused on copy operations, additional offloads may be defined and added to newer versions of the specification, as well as newer implementations. All new features shall be explicitly discovered and enabled by supporting software. Implementations may choose to implement different sets of offload capabilities.

In general, all SDXI OS/VM and user-level software is expected to operate transparently on any hardware implementation that supports the same or newer version of the specification that the software was designed for, as long as all of the expected offload capabilities are present.

# 2.5 Endian Format Support

The SDXI specification is written assuming a little-endian architecture. Support for other endian formats is outside the scope of this specification.

# 3.1 Overview

The SDXI architecture is independent of the underlying I/O interconnect. However, the SDXI specification includes a PCIe binding and PCIe is frequently used for examples throughout this specification. The following figure depicts all of the memory-addressed data structures used by an SDXI function. In order to facilitate efficient software-based virtualization, the majority of an SDXI function's architectural state resides in system memory and is described in this chapter. The remaining state resides in a small number of MMIO control registers (described in *"9, MMIO Control Registers"*) and PCI Function configuration space registers (described in *"8, SDXI PCI-Express Device Architecture"*). This layout of memory structures is designed to facilitate selective trapping by privileged software.





Software shall ensure that the SDXI function has the required read and write access for each memory structure as shown in *"Table 3–1: Memory Structure* Summary".

| Structure                               | Req"d<br>SDXI FN<br>Access           | Alignment   | Maximum Structure Size                                                                                         | Entry<br>Size | PASID Used                                                                            |  |  |  |
|-----------------------------------------|--------------------------------------|-------------|----------------------------------------------------------------------------------------------------------------|---------------|---------------------------------------------------------------------------------------|--|--|--|
| CXT_L2 Table                            | R                                    | 4 KByte     | 4 KBytes                                                                                                       | 8<br>Bytes    |                                                                                       |  |  |  |
| CXT_L1 Table                            | R                                    | 4 KByte     | 4 KBytes                                                                                                       | 32<br>Bytes   | _                                                                                     |  |  |  |
| CXT_CTL                                 | R                                    | 64 Byte     | 64 Bytes                                                                                                       | n/a           |                                                                                       |  |  |  |
| Error Log                               | W                                    | 4-KByte     | Let sz = MMIO_CAP1.max_errlog_sz;<br>0 <= sz <= 9;<br>size = 2**(23 + sz)<br>max_size = <b>2**32 byte</b> s.   | 64<br>Bytes   | Derived from<br>MMIO_CTL0.<br>If fn_pasid_vI == 1,                                    |  |  |  |
| AKey Table                              | R                                    | 4 KByte     | Let sz = MMIO_CAP1.max_akey_sz;<br>0 <= sz <= 8;<br>size = 2**(12 + sz)<br>max_size = <b>2**20 bytes</b> .     | 16<br>Bytes   | used.                                                                                 |  |  |  |
| RKey Table                              | R                                    | 4 KByte     | Let sz = MMIO_RKEY.tbl_sz;<br>0 <= sz <= 8;<br>size = 2**(12 + sz)<br>max_size = <b>2**20 bytes</b> .          | 16<br>Bytes   |                                                                                       |  |  |  |
| Descriptor Ring                         | R/W                                  | 64 Byte     | Let sz = MMIO_CAP0.max_ds_ring_sz;<br>0 <= sz <= 22;<br>size = 2**(16 + sz)<br>max_size = <b>2**38 bytes</b> . | 64<br>Bytes   |                                                                                       |  |  |  |
| CXT_STS                                 | R/W                                  | 16 Byte     | 16 Bytes                                                                                                       | n/a           | -                                                                                     |  |  |  |
| (CXT_STS.)<br>Read_Index                | R/W                                  | 8 Byte      | 8 Bytes                                                                                                        | n/a           | Derived from<br>CXT_L1_ENT.                                                           |  |  |  |
| Write_Index                             | R                                    | 8 Byte      | 8 Bytes                                                                                                        | n/a           | cxt_pasid is used.                                                                    |  |  |  |
| Completion<br>Status Block<br>(CST_BLK) | R/W                                  | 32-Byte     | 32 Bytes                                                                                                       | n/a           |                                                                                       |  |  |  |
| Atomic Return<br>Data                   | R/W                                  | Operand siz | ze is 4 or 8 bytes. Naturally aligned.                                                                         | n/a           |                                                                                       |  |  |  |
| Data Buffer                             | R or R/W<br>(see<br>Note 1<br>below) | n/a         | Let sz = MMIO_CAP1.max_buffer;<br>0 <= sz <= 11;<br>size = 2**(21 + sz)<br>max_size = <b>2**32 bytes</b> .     | n/a           | Derived from<br>AKEY_ENT.<br>If pv == 1 AND<br>tgt_sfunc == 0,<br>then pasid is used. |  |  |  |

Table 3–1: Memory Structure Summary

Notes:

 The function must be given read and write access to any data buffer that must be written by an operation, such as destination data buffers for storing operation results. This is necessary because in some cases the function is required to explicitly flush writes to data buffers and some interconnect technologies require reads to the buffer for flushing. For data buffers that only need to be read by an operation, such as source data buffers for the operation, the function need only be given read access.

# 3.2 Context Data Structures

An SDXI context represents the memory structures needed to directly monitor or control the operation of a descriptor ring. An SDXI context consists of a descriptor ring, and the associated Context Control (CXT\_CTL), Context Status (CXT\_STS), AKey Table, and Write\_Index structures. Memory buffers, completion signals, and atomic results which descriptor ring entries operate upon are not considered part of the SDXI function context; however, they are an important part of the additional corresponding state that software manages directly in order to use the descriptor ring.

SDXI uses a 2-level hierarchy of context tables (Context Table Level 2, Context Table Level 1) to enumerate the components of the context. The concatenation of the 9-bit Level 2 offset with the 7-bit Level 1 offset yields the 16-bit "context\_number" that is associated with the context. (Note that the Context Tables point to a context but are not themselves part of the context.) MMIO\_CAP1.max\_cxt indicates the maximum context number supported by the function. Software may further reduce the available contexts by programming a smaller value into MMIO\_CTL2.max\_cxt.

An SDXI function shall not access portions of the context tables associated with context numbers greater than MMIO\_CTL2.max\_cxt. See the example below.



Figure 3-2: MMIO\_CTL2.max\_cxt Example

The SDXI specification uses 64-bit pointers. When data structures are required to be aligned to a certain size, the lower bits of the pointer may be used for other purposes. This may result in the pointer field of a data structure or register being less than 64-bits.

### 3.2.1 Context Level 2 Table

The Context Base Table register points to the 4KB level 2 table containing an array of level 2 table entries described below. Each of those entries is a pointer to a level 1 table. A level 2 table contains 512 level 1 pointers.



| Field         | Bits    | Subfield | Description                                                                                                                                  |
|---------------|---------|----------|----------------------------------------------------------------------------------------------------------------------------------------------|
| u64 lv01_ptr; | 000     | vl       | Valid. When 1, indicates the other bits in this data structure are valid.<br>When 0, all other bits in this data structure shall be ignored. |
|               | 011:001 | rsvd     | Shall be set to zero.                                                                                                                        |
|               | 063:012 | lv01_ptr | Pointer to the start of a Context Level 1 table. This points to an aligned 4K region of memory.                                              |

| Table 3–2: | Context Le | evel 2 Table | Entry (CXT | L2_ENT <sup>[^3]</sup> ) |
|------------|------------|--------------|------------|--------------------------|
|            |            |              |            | /                        |

### 3.2.2 Context Level 1 Table

Each of the level 1 table structures is a 4K naturally aligned piece of memory containing an array of context level 1 table entries described below. A Context level 1 table has 128 entries.

#### Figure 3-4: Context Level 1 Table Entry (CXT\_L1\_ENT)

|                   |       | -     |       |     |    |      |      |      |      |      |    |    |      |      |    |    |   |        |     |   |     |      |    |        |
|-------------------|-------|-------|-------|-----|----|------|------|------|------|------|----|----|------|------|----|----|---|--------|-----|---|-----|------|----|--------|
| 31 30 29 28 27 26 | 25 24 | 23 22 | 2 21  | 20  | 19 | 18 1 | 7 16 | 15   | 5 14 | 1 13 | 12 | 11 | 10   | 9    | 8  | 7  | 6 | 5      | 4   | 3 | 2   | 1    | 0  |        |
|                   |       |       |       |     |    |      | cxt_ | ctl_ | ptr  |      |    |    |      |      |    |    |   | i<br>L | rsv | _ | pv  | ka   | vl | + 0x00 |
|                   |       |       |       |     |    |      |      |      |      |      |    |    |      |      |    |    |   |        |     |   |     |      |    | + 0x04 |
|                   |       | ake   | ey_pt | tr  |    |      |      |      |      |      |    |    |      |      | rs | sv |   |        |     |   | ake | y_sz | 2  | + 0x08 |
|                   |       |       |       |     |    |      |      |      |      |      |    |    |      |      |    |    |   |        |     |   |     |      |    | + 0x0C |
| rsv               |       | max   | _buf  | fer |    |      |      |      |      |      |    | C  | xt_p | basi | d  |    |   |        |     |   |     |      |    | + 0x10 |
|                   |       |       |       |     |    | 0    | pb_0 | 00_  | enl  | C    |    |    |      |      |    |    |   |        |     |   |     |      |    | + 0x14 |
|                   |       |       |       |     |    |      | r    | sv   |      |      |    |    |      |      |    |    |   |        |     |   |     |      |    | + 0x18 |
|                   |       |       |       |     |    |      |      |      |      |      |    |    |      |      |    |    |   |        |     |   |     |      |    | + 0x1C |

| Field            | Bits    | Subfield    | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |  |  |  |
|------------------|---------|-------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|
| u64 cxt_ctl_ptr; | 000     | vl          | Valid. When 1, indicates the other bits in this data structure are valid. When 0, all other bits in this data structure shall be ignored.                                                                                                                                                                                                                                                                                                                                                |  |  |  |
|                  | 001     | ka          | Keep-Active-Hint. Hint to the SDXI function. Does not affect<br>context state.<br>When "1", the function operates the context normally and should<br>execute new descriptors. This should be set to "1" by privileged<br>software when it starts the context.<br>When "0", the function may choose to stop executing new<br>descriptors in anticipation that software will soon stop the<br>context. This may reduce the time that software must wait for<br>stopping the context later. |  |  |  |
|                  | 002     | pv          | PASID Valid. Indicates the cxt_pasid field is valid.                                                                                                                                                                                                                                                                                                                                                                                                                                     |  |  |  |
|                  | 005:003 | rsvd        | Shall be set to zero.                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |  |  |  |
|                  | 063:006 | cxt_ctl_ptr | Pointer to the Context Control (CXT_CTL). This points to a 64B aligned region of memory.                                                                                                                                                                                                                                                                                                                                                                                                 |  |  |  |
| u64 akey_ptr;    | 067:064 | akey_sz     | AKey Table Size. Indicates the size of the AKey table reference<br>by akey_ptr. The table size is encoded as 2**(akey_sz+12) by<br>or 2**(akey_sz+8) entries. Encodings greater than 0x8 are<br>reserved.                                                                                                                                                                                                                                                                                |  |  |  |
|                  | 075:068 | rsvd        | Shall be set to zero.                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |  |  |  |
|                  | 127:076 | akey_ptr    | Pointer to the start of an AKey table. This points to a 4Kbyte aligned region of memory.                                                                                                                                                                                                                                                                                                                                                                                                 |  |  |  |
| u32 misc0;       | 147:128 | cxt_pasid   | If pv=1, indicates the PASID value used to access the<br>Write_Index, CXT_STS, CST_BLK, Atomic Return Data, and<br>Descriptor Ring.<br>If pv=0, this field is not used and the Write_Index, CXT_STS,<br>CST_BLK and Descriptor Ring are accessed without a PASID.                                                                                                                                                                                                                        |  |  |  |
|                  | 151:148 | max_buffer  | Indicates the maximum data buffer size supported by this context. The size is encoded as 2**(max_buffer+21). Descriptor type or the size of the buffer length field within certain types of descriptors may further limit the maximum data buffer size for certain types of operations.<br>This field should not be set to exceed MMIO_CTL2.max_buffer                                                                                                                                   |  |  |  |
|                  | 159:152 | rsvd        | Shall be set to zero.                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |  |  |  |
| u32 opb_000_enb; | 191:160 |             | Each bit in this field, when set to "1", indicates that a certain descriptor operation group is enabled within this context; when "0", it is not. The encoding of the bits matches the MMIO_CAP1.opb_000_cap register. See "5.1, Descriptor Operations" and "Chapter 6, SDXI Descriptor and Operation Specification" for more details.                                                                                                                                                   |  |  |  |
| u8 rsvd 0[8];    | 255:192 |             | Shall be set to zero.                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |  |  |  |

Table 3–3: Context Level 1 Table Entry (CXT\_L1\_ENT<sup>[^3]</sup>)

# 3.2.3 Context Control (CXT\_CTL)

The Context Control contains control information for a single descriptor ring.

Software shall expose the memory containing the Context Control as readable to the SDXI function hosting the context. Refer to *"Table 3–1: Memory Structure* Summary" for further requirements on how software shall expose the structures pointed to by CXT\_CTL to the SDXI function. This structure is intended to be controlled by privileged software.

| $\bullet$ $\cdot$ $=$ $i$                                                             | _      |
|---------------------------------------------------------------------------------------|--------|
| 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 |        |
| ds_ring_ptr csa_se_qos_rsv_vl                                                         | + 0x00 |
|                                                                                       | + 0x04 |
| ds_ring_sz                                                                            | + 0x08 |
| rsv                                                                                   | + 0x0C |
| cxt_sts_ptr rsv                                                                       | + 0x10 |
|                                                                                       | + 0x14 |
| write_index_ptr rsv                                                                   | + 0x18 |
|                                                                                       | + 0x1C |
| rsv                                                                                   | + 0x20 |
|                                                                                       | + 0x24 |
|                                                                                       | + 0x28 |
|                                                                                       | + 0x2C |
|                                                                                       | + 0x30 |
|                                                                                       | + 0x34 |
|                                                                                       | + 0x38 |
|                                                                                       | + 0x3C |

#### Figure 3-5: Context Control (CXT\_CTL)

| Field                | Bits    | Subfield        | Descrip                                                                                                                                   | ption                                                                                                                                                                                           |                                                                                                                                                                                                                    |                                                               |  |  |  |
|----------------------|---------|-----------------|-------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------|--|--|--|
| u64 ds_ring_ptr;     | 000     | vl              | Valid. When 1, indicates the other bits in this data structure are valid. When 0, all other bits in this data structure shall be ignored. |                                                                                                                                                                                                 |                                                                                                                                                                                                                    |                                                               |  |  |  |
|                      | 001     | rsvd            | Shall be                                                                                                                                  | e set to zero.                                                                                                                                                                                  |                                                                                                                                                                                                                    |                                                               |  |  |  |
|                      | 003:002 | qos             | Quality implem                                                                                                                            | of service indicator. T<br>entation specific.                                                                                                                                                   | The usage of this field is                                                                                                                                                                                         |                                                               |  |  |  |
|                      |         |                 |                                                                                                                                           | Value                                                                                                                                                                                           | Definition                                                                                                                                                                                                         | ]                                                             |  |  |  |
|                      |         |                 |                                                                                                                                           | 00b                                                                                                                                                                                             | Urgent                                                                                                                                                                                                             | ]                                                             |  |  |  |
|                      |         |                 |                                                                                                                                           | 01b                                                                                                                                                                                             | High                                                                                                                                                                                                               |                                                               |  |  |  |
|                      |         |                 |                                                                                                                                           | 10b                                                                                                                                                                                             | Medium                                                                                                                                                                                                             |                                                               |  |  |  |
|                      |         |                 |                                                                                                                                           | 11b                                                                                                                                                                                             | Low                                                                                                                                                                                                                |                                                               |  |  |  |
|                      | 004     | se              | Sequer<br>set the                                                                                                                         | ntial Consistency Hint<br>S bit in most descript                                                                                                                                                | . 1=Descriptor ring is expectors.                                                                                                                                                                                  | ted to                                                        |  |  |  |
|                      | 005     | csa             | Complete<br>Status I<br>SDXI fu<br>0 = 1<br>1 =                                                                                           | etion-Status Mode Ava<br>Modes" for details. Th<br>unction takes no actio<br>Both atomic completio<br>completion-status mo<br>The context shall only<br>mode.                                   | ailability. See <i>"4.4.2, Comp</i><br>nis field is informational only<br>n on it.<br>on-status mode and simple<br>de are available to the cont<br>/ use simple completion-sta                                     | <i>letion-</i><br>/; the<br>text.<br>atus                     |  |  |  |
|                      | 063:006 | ds_ring_ptr     | Pointer<br>byte ali                                                                                                                       | to the start of the des<br>gned region of memo                                                                                                                                                  | scriptor ring. This points to a<br>ry                                                                                                                                                                              | a 64-                                                         |  |  |  |
| u32 ds_ring_sz;      | 095:064 |                 | Indicate<br>can be<br>ring in b<br>is illega<br>0xFFFF<br>Softwar<br>- Let ma<br>- Then                                                   | es the maximum unsig<br>placed in the descript<br>pytes is calculated as<br>al – a ring must have a<br>F_FFFF indicates a m<br>re shall ensure that:<br>ax_ds = 2**(MMIO_0<br>CXT_CTL.ds_ring_s | gned number of descriptors<br>tor ring. The size of the des<br>(ds_ring_sz * 64). A value<br>at least one descriptor. A va<br>naximum of (2**32)-1 descri<br>CAP0.max_ds_ring_sz + 10<br>z <= min( (2**32)-1, max_ | that<br>criptor<br>of zero<br>alue of<br>ptors.<br>0)<br>ds ) |  |  |  |
| u8 rsvd_0[4];        | 127:096 |                 | Shall be                                                                                                                                  | e set to zero.                                                                                                                                                                                  |                                                                                                                                                                                                                    |                                                               |  |  |  |
| u64 cxt_sts_ptr;     | 131:128 | rsvd            | Shall be                                                                                                                                  | e set to zero.                                                                                                                                                                                  |                                                                                                                                                                                                                    |                                                               |  |  |  |
|                      | 191:132 | cxt_sts_ptr     | Pointer<br>includes<br>region o                                                                                                           | to the Context Status<br>s the read index value<br>of memory.                                                                                                                                   | s (CXT_STS) data structure<br>e. This points to a 16B aligr                                                                                                                                                        | which<br>1ed                                                  |  |  |  |
| u64 write_index_ptr; | 194:192 | rsvd            | Shall be                                                                                                                                  | e set to zero.                                                                                                                                                                                  |                                                                                                                                                                                                                    |                                                               |  |  |  |
|                      | 255:195 | write_index_ptr | Pointer<br>aligned                                                                                                                        | to descriptor ring writ<br>region of memory.                                                                                                                                                    | te index. This points to an 8                                                                                                                                                                                      | B                                                             |  |  |  |
| u8 rsvd_1[32];       | 511:256 |                 | Shall be                                                                                                                                  | e set to zero.                                                                                                                                                                                  |                                                                                                                                                                                                                    |                                                               |  |  |  |

#### Table 3–4: Context Control (CXT\_CTL<sup>[^3]</sup>)

# 3.2.4 Context Status (CXT\_STS)

The Context Status contains status information for a single descriptor Context. Privileged software may expose it to non-privileged software as a read-only structure. When creating the context, software shall initialize the status entry to all-zero prior to making the context valid. Software shall expose the memory containing the Context Status as read-write to the SDXI function using the context address space.

| Figure 3-6: Context Status (CXT_STS) |                        |          |                  |    |     |   |   |        |     |        |  |
|--------------------------------------|------------------------|----------|------------------|----|-----|---|---|--------|-----|--------|--|
| 31 30 29 28 27 26 25 2               | 4 23 22 21 20 19 18 17 | 16 15 14 | 13 12 11 10      | 98 | 76  | 5 | 4 | 3      | 2 1 | L 0    |  |
|                                      |                        | rsv      | <mark>rsh</mark> | I  | rsv |   |   | state  | 5   | + 0x00 |  |
|                                      |                        |          |                  |    |     |   |   |        |     | + 0x04 |  |
| read_index                           |                        |          |                  |    |     |   |   | + 0x08 |     |        |  |
| +                                    |                        |          |                  |    |     |   |   | + 0x0C |     |        |  |

| Field           | Bits    | Subfield | Description                                                                                                                                                                                                                                                                                                                                                                                                     |  |  |  |  |
|-----------------|---------|----------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|--|
| u8 state;       | 003:000 | state    | Context State. See Table <i>"Table 3–6: CXT_STS.state Encoding"</i> for encodings. See section <i>"4.3.1"</i> for when this field may be modified.                                                                                                                                                                                                                                                              |  |  |  |  |
|                 | 007:004 | rsvd     | This reserved field may be read and written by the SDXI function when it accesses bits [3:0] of the same byte.                                                                                                                                                                                                                                                                                                  |  |  |  |  |
| u8 misc0;       | 008     | rsh      | This hint bit is written only by privileged software to indicate to other software if new descriptors should (rsh = 1) or should not (rsh = 0) be submitted to the context when the ContextState is CXTV_STOPG_SW or CXTV_STOP_SW. See "4.3.6, Context Ring Submission Hint" for more details. The SDXI function shall not use nor write this bit. Software shall write this bit with a byte-sized access only. |  |  |  |  |
|                 | 015:009 | rsvd     | Shall be set to zero.                                                                                                                                                                                                                                                                                                                                                                                           |  |  |  |  |
| u8 rsvd0_[6];   | 063:016 |          | Shall be set to zero.                                                                                                                                                                                                                                                                                                                                                                                           |  |  |  |  |
| u64 read_index; | 127:064 |          | Descriptor ring read index.                                                                                                                                                                                                                                                                                                                                                                                     |  |  |  |  |

#### Table 3–5: Context Status (CXT\_STS<sup>[^3]</sup>)

#### Table 3–6: CXT\_STS.state Encoding

| Value          | Definition                                                                                                                                                                                                                                                                 |
|----------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 0000b          | <b>CXTV_STOP_SW</b> : Stopped by software using a local-function DSC_CXT_STOP operation on the context.                                                                                                                                                                    |
| 0001b          | CXTV_RUN : Running.                                                                                                                                                                                                                                                        |
| 0010b          | <b>CXTV_STOPG_SW</b> : Stopping in response to software using a local-function DSC_CXT_STOP operation on the context.                                                                                                                                                      |
| 0100b          | <b>CXTV_STOP_FN</b> : Stopped either by a transition of the function state from GSV_ACTIVE, or a (PF-to-VF) P2V.DSC_CXT_STOP operation.                                                                                                                                    |
| 0110b          | <b>CXTV_STOPG_FN</b> : Stopping in response to the function or a P2V.DSC_CXT_STOP operation.<br>The function stopping causes are: an error detected in the function; an error in context processing or execution; or a transition of the function state out of GSV_ACTIVE. |
| 1111b          | <b>CXTV_ERR_FN</b> : Stopped in response to an error detected in the function, an error in context processing, or an error in context execution.                                                                                                                           |
| All other valu | ies reserved.                                                                                                                                                                                                                                                              |

# 3.2.5 Access Key (AKey) Table Entry (AKEY\_ENT)

The AKey table is a 4-Kbyte aligned contiguous memory structure composed of AKey table entries. The table may be any power-of-2 size from 4-Kbytes up to 1-Mbyte. Software controls the size of the table through the context level 1 entry structure.

A descriptor may reference any AKey associated with the same ring context. The AKey table entries encode all of the valid address spaces, PASIDs and interrupts available to the context.

The AKey entry's tgt\_sfunc field specifies the target function within the SDXI function group which owns data buffers and interrupts associated with the AKey table entry. The tgt\_sfunc field is set to the target function's value of MMIO\_CAP0.sfunc. For PCIe implementations, the tgt\_sfunc field is an opaque identifier for the target SDXI function which is used when accessing the data buffer and issuing MSI or MSI-X interrupts.

The tgt\_sfunc encoding of 0 is used to indicate the target resource belongs to the same function executing the descriptor. Only the 0 encoding may be used to access local resources.

A non-zero tgt\_sfunc value indicates that a remote function owns the target resource. See "3.3.1, SDXI *Function Group*" for more details.

|                                     | 3             |         | _          |      |     |   |     |    |    |    |        |
|-------------------------------------|---------------|---------|------------|------|-----|---|-----|----|----|----|--------|
| 31 30 29 28 27 26 25 24 23 22 21 20 | 19 18 17 16 1 | 5 14 13 | 12 11 10 9 | 8 7  | 6 5 | 4 | 3   | 2  | 1  | 0  |        |
| tgt_sfunc                           | rs            | 6V      | intr_n     | um   |     |   | ste | pv | iv | vl | + 0x00 |
| ph rsv                              |               |         | pasid      |      |     |   |     |    |    |    | + 0x04 |
| rsv                                 |               |         |            | stag |     |   |     |    |    |    | + 0x08 |
| rsv                                 |               |         |            | rkey |     |   |     |    |    |    | + 0x0C |

#### Figure 3-7: AKEY\_ENT

| Field          | Bits    | Subfield | Description                                                                                                                                                                                                                                                                                                                                                       |
|----------------|---------|----------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u16 intr_num;  | 000     | vl       | Valid. When 1, indicates the other bits in this data structure are valid.<br>When 0, all other bits in this data structure shall be ignored.                                                                                                                                                                                                                      |
|                | 001     | iv       | Interrupt Valid. When 1 and tgt_sfunc = 0, the intr_num field is valid. When tgt_sfunc $\neq$ 0 this bit is reserved and must be set to 0                                                                                                                                                                                                                         |
|                | 002     | pv       | PASID Valid. When 1 and tgt_sfunc = 0, the PASID field contains valid information. When tgt_sfunc $\neq$ 0 this bit is reserved and must be set to 0                                                                                                                                                                                                              |
|                | 003     | ste      | Steering Enable. When 1 and tgt_sfunc = 0, memory requests<br>referencing this AKey table entry are enabled to include Data Steering<br>Hint (DSH) information when requested through the descriptor<br>attr.coh_ctl field.<br>When 0 and tgt_sfunc = 0, DSH is disabled for memory requests<br>referencing this AKey table entry even when requested through the |
|                |         |          | descriptor attr.coh_ctl field.<br>When tgt_sfunc ≠ 0 this bit is reserved and must be set to 0.<br>See "3.6, Data Steering Hints (DSH)" for more details.                                                                                                                                                                                                         |
|                | 014:004 | intr_num | Interrupts generated using this AKey table entry are issued using the MSI or MSI-X entry corresponding to intr_num. When tgt_sfunc ≠ 0 this field is reserved, and the Target Function's RKey entry specifies which MSI or MSI-X entry to use                                                                                                                     |
|                | 015     | rsvd     | Shall be set to zero.                                                                                                                                                                                                                                                                                                                                             |
| u16 tgt_sfunc; | 031:016 |          | tgt_sfunc, see above discussion.                                                                                                                                                                                                                                                                                                                                  |
| u32 pasid;     | 051:032 | pasid    | PASID value used for requests using this AKey table entry.                                                                                                                                                                                                                                                                                                        |
|                | 061:052 | rsvd     | Shall be set to zero.                                                                                                                                                                                                                                                                                                                                             |
|                | 063:062 | ph       | Processing Hint. When ste is "1", tgt_sfunc = 0, this field supplies information used as part of DSH.<br>When tgt_sfunc $\neq$ 0 this field is reserved and must be set to 0. The Target Function's RKey table entry is used to supply DSH information.<br>See "3.6, Data Steering Hints (DSH)" for more details.                                                 |
| u16 stag;      | 079:064 |          | Steering Tag. When ste is "1", tgt_sfunc = 0, this field supplies<br>information used as part of DSH.<br>When tgt_sfunc $\neq$ 0 this field is reserved and must be set to 0. The<br>Target Function's RKey table entry is used to supply DSH information.<br>See "3.6, Data Steering Hints (DSH)" for more details.                                              |
| u8 rsvd_0[2];  | 095:080 |          | Shall be set to zero.                                                                                                                                                                                                                                                                                                                                             |
| u16 rkey;      | 111:096 |          | Specifies the RKey value used to access another function's data buffer or interrupt. This field is only valid if tgt_sfunc is non-zero. See above discussion for more details.                                                                                                                                                                                    |
| u8 rsvd_1[2];  | 127:112 |          | Shall be set to zero.                                                                                                                                                                                                                                                                                                                                             |

Table 3–7: AKey Table Entry (AKEY\_ENT<sup>[^3]</sup>)

# 3.3 SDXI Cross-Function Access

There can be multiple interacting SDXI functions in a system if supported by the SDXI implementation. Typically, a PCIe function can only access address spaces mapped using its own Requester ID. A unique capability of an SDXI function is the optional ability to access address spaces mapped by the Requestor ID of another SDXI function.

When executing a descriptor operation, the SDXI function determines the address space for each specified data buffer or interrupt target by using the associated AKey value to reference an AKey table entry. This entry provides the tgt\_sfunc which identifies another SDXI function within the same SDXI Function Group. When the tgt\_sfunc field is non-zero, the target resource belongs to a remote function whose MMIO\_CAP0.sfunc register value matches tgt\_sfunc.

The Receiver Access Key (RKey) mechanism described in "3.3.2, Receiver Access Key (RKey) Table" provides access control over cross-function requests.

### 3.3.1 SDXI Function Group

When an SDXI Function returns MMIO\_CAP1.rkey\_cap as "1", the function supports SDXI cross-function access and the RKey protection mechanism; otherwise, these mechanisms are not supported. Note that SDXI cross-function access when supported is only possible between SDXI functions that belong to the same SDXI Function Group. An SDXI PF and its associated VFs always belong to the same SDXI function group may contain multiple PFs and VFs. A system may contain multiple disjoint SDXI function groups. Software may enumerate SDXI Function groups using the MMIO\_GRP\_ENUM register in each SDXI function; this mechanism only supports a single thread of enumeration. An example of the enumeration flow is given in *"Figure 3-8: Function Group Enumeration Example"*. Further methods of coordination and configuration between SDXI functions within an SDXI function group are outside the scope of this specification.

In a cross-function implementation of SDXI, all functions connected within an SDXI function group reflect the same MMIO\_GRP\_ENUM.probe value written into any group member's copy of this register. Provided that software first initializes all probe fields to "0", software may identify all functions within a function group by writing "1" into MMIO\_GRP\_ENUM.probe field in one PF, determining that the write has propagated, and then scanning for the same value in other SDXI functions. Software may assign an ID to each function group and record it in the MMIO\_CTL0.fn\_grp\_id field of each associated PF separately for later use; the fn\_grp\_id field does not propagate among functions. In a VF, the field is read-only and reflects the value of the associated PF.

When the MMIO\_GRP\_ENUM register is written in one PF, the value of its "probe" field shall subsequently be propagated and reflected in the MMIO\_GRP\_ENUM.probe field of all other PFs within the same function group within an implementation-specific period of time. Until the "probe" field fully propagates, reading of the the probe field in any other function within the group may return the old or new value of the field.

The SDXI function provides the MMIO\_GRP\_ENUM.busy field to determine write propagation. Software shall write the "busy" field to a 1 when writing to the MMIO\_GRP\_ENUM register. An SDXI function may ignore propagating the probe field if the busy field is written as "0"; software shall avoid and not rely upon such behavior. The SDXI function shall clear the "busy" field of that PF when the write of the "probe" field has fully propagated to all other functions within the function group. Software determines the write has propagated by looping on the value of that PF's "busy" field until it is read as "0". The "busy" field itself does not propagate.

When software writes MMIO\_GRP\_ENUM in one PF, it should not write the same register in any other SDXI function until the first write fully propagates.

The MMIO\_GRP\_ENUM register is only used to identify functions within the same SDXI function group. It is not referenced by other SDXI registers or data structures. In a VF, the field is read-only and reflects the value of the associated PF.

#### Figure 3-8: Function Group Enumeration Example

/\* SDXI Fn Group Enumeration Example Pseudo-code

```
Let PF LIST be an array of fn struct, one for each PF. Let VF_LIST be an array of fn_struct, one for each VF.
  This can be determined for each SDXI using MMIO CAP0.vf. In practice, a real OS would use something
  different. Let each fn struct have the fn's requestor id, mmio base ptr, pf flag (if pf then 1 else 0), and the SW
  assigned grp id.
  Let read & write of a fn's mmio space be done by the functions: get fn mmio(fn struct, offset) -- returns the
  location value; and set_fn_mmio(fn_struct, offset, value) -- writes the location.
  Let ID LIST contain a list of all known GRP ID values. Let clear ids(id list) clear the list and add id(id list, id)
  add to the list. Define get new id(id, id list) to return a new unique 32-bit grp id.
*/
set_grp_enum(fn, probe){
   // Good Hygiene: Wait for probe-propagating to be false
   while (get_fn_mmio(fn, MMIO_GRP_ENUM) & 1);
   set_fn_mmio(fn, MMIO_GRP_ENUM, (probe << 1) | 1);</pre>
   // Wait for probe-propagating to be false
   while (get_fn_mmio(fn, MMIO_GRP_ENUM) & 1);
}
set_grp_id(fn, id){
   id = (get_fn_mmio(fn, MMIO_CTL0) & 0xFFFF_FFFF) | (id << 32);</pre>
   set_fn_mmio(fn, MMIO_CTL0, id);
}
get_grp_id(fn){
   return get_fn_mmio(fn, MMIO_CTL0) >> 32;
}
// SO: Initialize PFs and tracking info
clear_ids(ID_LIST);
for (i = 0; i < NUMBER_OF_PF; i++)
   pf = PF_LIST[i];
   pf.grp_id = 0;
   set_grp_id(pf, 0);
   set_grp_enum(pf, 0);
}
 // S1: Init VF data and add Grp_IDs from VFs not of PFs in PF_LIST
for (i = 0; i < NUMBER_OF_VF; i++){
   vf = VF_LIST[i];
   vf.grp_id = get_grp_id(vf);
   if (vf.grp_id != 0){ add_id(ID_LIST, vf.grp_id); }
```

}

```
// S2: Probe through all PFs and assign GRP IDs
for (i = 0; i < NUMBER_OF_PF; i++){</pre>
   p\hat{f}0 = PF_LIST[i];
   if (pf0.grp_id != 0){ continue; } // pf already enumerated
   // S3: pf is not yet enumerated, assign new grp_id & probe
   pf0.grp_id = get_new_id(ID_LIST);
   set_grp_id(pf0, pf0.grp_id);
   set_grp_enum(pf0, 1);
   // S4: Probe remaining PFs for FN Grp membership
   for (j = i+1; j < NUMBER_OF_PF; j++){
      pf1 = PF_LIST[j];
      if (pf1.grp_id != 0) { continue; } // Already enumerated
         ' S5: If probe detected, must be part of the current group
      if ( get_fn_mmio(pf1, MMIO_GRP_ENUM) & Ob10 ){
          pf1.grp_id = pf0.grp_id;
         set_grp_id(pf1, pf0.grp_id);
      }
   }
   // S6: This group is probed, clear the probe bit
set_grp_enum(pf0, 0);
}
// S7: Scan for VFs of PFs in PF_LIST and record grp_id
for (i = 0; i < NUMBER_OF_VF; i++){
   vf = VF_LIST[i];
   if ( vf.grp_id == 0){ vf.grp_id = get_grp_id(vf); }
}
```

# 3.3.2 Receiver Access Key (RKey) Table

A (local) SDXI function uses RKey table entries to control remote requesting functions' access to local function resources such as memory and interrupts. Software may use RKey table entries in combination with IOMMU page tables to selectively expose data buffers and interrupts to different requesting functions.

The RKey table is a 4-Kbyte aligned contiguous memory structure composed of RKey table entries. The table may be any power-of-2 size from 4-Kbytes up to 1 Mbyte. Software controls the size of the table through MMIO\_RKEY.tbl\_sz.

A connection manager specification is proposed for development by the SNIA SDXI TWG post publication of Smart Data Accelerator Interface ("SDXI") Specification v1.0. Please refer to it for mechanisms to allocate, configure, and exchange RKeys between functions.
## 3.3.3 RKey Table Entry

Each RKey table entry is an aligned 16-byte structure with the following format.

| Figure 3-10: RKey Table Entry       |             |                           |  |      |      |     |    |   |   |   |     |    |    |    |        |        |
|-------------------------------------|-------------|---------------------------|--|------|------|-----|----|---|---|---|-----|----|----|----|--------|--------|
| 31 30 29 28 27 26 25 24 23 22 21 20 | 19 18 17 16 | 19 18 17 16 15 14 13 12 1 |  |      |      | 9   | 8  | 7 | 6 | 5 | 4   | 3  | 2  | 1  | 0      |        |
| req_sfunc                           | rsv         |                           |  | intr | r_nu | ım  |    |   |   |   | ste | pv | iv | vl | + 0x00 |        |
| ph rsv                              |             |                           |  | pas  | sid  |     |    |   |   |   |     |    |    |    | + 0x04 |        |
| rsv                                 |             |                           |  |      |      | sta | ag |   |   |   |     |    |    |    | + 0x08 |        |
|                                     |             |                           |  |      |      |     |    |   |   |   |     |    |    |    |        | + 0x0C |

| Field          | Bits    | Subfield | Description                                                                                                                                                                                                                                                                                                                                                                     |
|----------------|---------|----------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u16 intr_num;  | 000     | VI       | Valid. When 1, indicates the other bits in this data structure are valid. When 0, all other bits in this data structure shall be ignored and remote requesting functions may not use the RKey value associated with this entry to access data buffers or issue interrupts owned by this function.                                                                               |
|                | 001     | iv       | Interrupt Valid. When 1, Requesting functions referencing this RKey table entry are permitted to generate interrupt requests via this function. When 0, interrupt requests generated by referencing this RKey table entry are aborted.                                                                                                                                          |
|                | 002     | pv       | PASID Valid. When 1, the PASID field contains valid information.                                                                                                                                                                                                                                                                                                                |
|                | 003     | ste      | Steering Enable. When 1, memory requests referencing this RKey table entry are enabled to include DSH information when requested through the descriptor attr.coh_ctl field. When 0, DSH is disabled for memory requests referencing this RKey table entry even when requested through the descriptor attr.coh_ctl field. See "3.6, Data Steering Hints (DSH)" for more details. |
|                | 014:004 | intr_num | Interrupts generated using this RKey table entry are issued using<br>the target function's MSI or MSI-X entry corresponding to<br>intr_num. When valid, intr_num may be used as part of PCIe<br>TPH. See "8.3, Mapping SDXI DSH to PCIe TLP Processing Hints<br>(PCIe TPH)" for more details.                                                                                   |
|                | 015     | rsvd     | Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                              |
| u16 req_sfunc; | 031:016 |          | req_sfunc specifies the "sfunc" value of the remote requesting function expected to reference this RKey table entry.                                                                                                                                                                                                                                                            |
| u32 pasid;     | 051:032 | pasid    | PASID value used to access data buffers using this RKey entry                                                                                                                                                                                                                                                                                                                   |
|                | 061:052 | rsvd     | Shall be set to zero.                                                                                                                                                                                                                                                                                                                                                           |
|                | 063:062 | ph       | Processing Hint. When "ste" is "1", this field supplies information used as part of DSH. When "ste" is "0", this field is reserved. See <i>"3.6, Data Steering Hints (DSH)"</i> for more details.                                                                                                                                                                               |
| u16 stag;      | 079:064 |          | Steering Tag. When ste is "1", this field supplies information used as part of DSH. When "ste" is "0", this field is reserved. See "3.6, <i>Data Steering Hints (DSH)</i> " for more details.                                                                                                                                                                                   |
| u8 rsvd_0[6];  | 127:080 |          | Shall be set to zero.                                                                                                                                                                                                                                                                                                                                                           |

#### Table 3–8: RKey Table Entry (RKEY\_ENT<sup>[^3]</sup>)

## 3.3.4 Receiver Access Key (RKey) Processing

When an operation's AKey table entry specifies a "tgt\_sfunc" value of zero (i.e. the access is within a function), RKey processing for that access or interrupt is skipped. When "tgt\_sfunc" is non-zero (i.e. the access is Cross-Function), RKey processing is attempted using the following checks below.

- 1. If the Target function does not belong to the same function group as the requesting function, abort the remote access or interrupt.
- 2. Requesting function passes its own MMIO\_CAP0.sfunc value along with the AKey table entry's RKey value to the target function indicated by "tgt\_sfunc".
- 3. If the target function has MMIO\_CAP1.rkey\_cap as "0", abort the remote access or interrupt.
- 4. Target function uses the supplied RKey value as an index into its RKey table to obtain the associated RKey table entry. If the RKey table is not enabled (MMIO\_RKEY.tbl\_en is "0"), or the RKey table entry is not valid, abort the remote access or interrupt.
- 5. Target function checks if the RKey table entry req\_sfunc matches the requester "sfunc". If not, abort the remote access or interrupt.
- 6. For an interrupt operation, the target function checks if the RKey table entry has "iv" as "1". If not, abort the remote interrupt. When "iv" is "1", the intr\_num field in the RKey table entry specifies which MSI or MSI-X index in the target function is used.
- 7. For a memory access operation, if the RKey table entry has "pv" as "0", the associated memory accesses are issued by the target function with no PASID applied. When "pv" is "1", the associated memory accesses are issued by the target function with a PASID applied using the "pasid" field from the RKey table entry.
- 8. For a memory access operation, if the RKey table entry has "se" as "0", the associated memory accesses are issued with no SteeringTag or PH applied. When "se" is "1", a SteeringTag and PH are applied to the associated memory accesses using the "stag" and "ph" fields in the RKey table entry.

The requesting SDXI function will log failed remote accesses as Data Buffer errors (ERRV\_DSC\_BUF) regardless of the reason for the failure. The requesting SDXI function may use any appropriate value for the error sub-step, including 0 ('Other'). Data Buffer failures will stop the requesting context. Examples of such failures include communication errors between the requester and target, the target being off-line, access rights failures at the target, address translation errors at the target, or data access errors at the target.

The target SDXI function will not log problems with received requests such as: illegal or disabled RKey indexes; mismatched sfunc values; or attempted accesses while the function or RKey processing is disabled. The target SDXI function also will not log issues encountered while accessing data buffers on behalf of a remote requester such as address translation, poison consumption, or time out errors. Instead, these issues are logged in the requesting function.

If a target SDXI function encounters the following type of error trying to access its own RKey table then the SDXI function will log a "ERRV\_FN\_RKEY" error and continue execution: an address translation error; poisoned RKey table entry; and illegal data within an Rkey entry with the "vl" bit set to 1. If an SDXI remote request is rejected due to the RKey lookup failure, the requestor will also log the failure as a ERRV\_DSC\_BUF, and will halt the impacted context.

# 3.4 Error Log

Whenever the SDXI function detects a function-wide or context-specific error, it performs specific actions to contain the error, reports it to software through an entry in the function's Error Log message ring, and signals an error in an associated descriptor's completion status block if relevant. A summary of the type of errors, their logging, and the associated error containment actions follows.

- 1. *Invalid Context (Invalid:Cxt):* the function has evaluated a context that is in CXTV\_INVALID state. The function shall skip further operation on the context. Unless explicitly required by an action or operation to suppress or signal an error, an SDXI implementation should suppress signaling errors for Invalid:Cxt. See "4.2.1, S1: CXTV\_INVALID State" for more details.
- 2. Logging A Context Error (LogErr:Cxt): the function generates an error log entry when it detects a context error which can include DescrErr:Cxt and SignalErr:Cxt errors. After the error log entry is globally visible, the function shall signal an interrupt if MMIO\_ERR\_CTL.intr\_en is set. This action can start in any order or in parallel with DescrStatus:Cxt, SignalErr:Cxt, and StopErr:Cxt but shall complete before StopErr:Cxt does. See "



Figure 3-11: Context Descriptor Or Operation Error Flow" for an overall flow of the error handling.

3. **Updating CST\_BLK.er (DescrErr:Cxt):** If the CST\_BLK is enabled for a descriptor ("np" is "0") and a descriptor error is detected in processing or executing a descriptor, then CST\_BLK.er is set and made globally visible before the function updates CST\_BLK.signal. (Note, errors encountered in updating CST\_BLK for this action do not start any additional actions.) This action can start in any order or in parallel with LogErr:Cxt, SignalErr:Cxt, and StopErr:Cxt but shall complete before StopErr:Cxt does. See "



Figure 3-11: Context Descriptor Or Operation Error Flow" for an overall flow of the error handling.

4. **Error In Updating CST\_BLK.signal (SignalErr:Cxt):** If an error occurs when updating CST\_BLK.signal, then the SDXI function shall set CST\_BLK.er, if not already set, and make it globally visible. (Note, an error in setting CST\_BLK.er for this action does not start any additional actions.) This action will also start in any order or in parallel LogErr:Cxt and StopErr:Cxt actions, if not already started. This action shall complete before StopErr:Cxt does. See "



Figure 3-11: Context Descriptor Or Operation Error Flow" for an overall flow of the error handling.

5. **Stopping A Context Due to Error (StopErr:Cxt):** the function initiates a background context stop action for a context in the CXTV\_RUN state in response to an Invalid:Cxt (when indicated), DescrErr:Cxt, or SignalErr:Cxt. The stop action terminates processing of new descriptors and waits for existing ones to complete. Once the context is stopped, the function serializes Read\_Index and then transitions the context to CXTV\_ERR\_FN state. This action starts in any order or in parallel with LogErr:Cxt, DescrErr:Cxt, and SignalErr:Cxt but shall be completed only after them. This completion order ensures that when software sees the context in CXTV\_ERR\_FN state, all relevant error log records and CST\_BLK entries are updated and globally visible. See "



Figure 3-11: Context Descriptor Or Operation Error Flow" for an overall flow of the error handling.

- 6. *Function-Wide Error (HaltErr:Fn):* the function initiates an error halt action in response to an uncorrectable error that prevents further safe operation of the function across all of its associated contexts. Examples include the detection of invalid configuration of MMIO registers (due to privileged software) as well as internal logic errors inside the function that lead to a HItErr:Fn action. For this action the function does the following ordered steps.
  - a. In any order or in parallel: the function halts itself; and makes best effort to log a function error (LogErr:Fn) and make it globally visible. It is implementation dependent if the function performs any context-specific state changes or stop actions when halting the function; software shall not rely upon it.
  - b. When the above completes, the function changes MMIO\_STS0.fn\_gsv to GSV\_ERROR. This order ensures that when software reads the error log immediately after detecting the transition to GSV\_ERROR, it will receive all available information.
- 7. **Logging A Function Error (LogErr:Fn):** the function attempts to generate an error log entry when an error halt action (HaltErr:Fn) is initiated.



Figure 3-11: Context Descriptor Or Operation Error Flow

Note that as a consequence of the above error flow for a context error, the SDXI function shall ensure that the error's effect upon the error log, the descriptor's CST\_BLK, Read\_Index serialization, and the context's state are synchronized when the context state transitions to CXTV\_ERR\_FN -- not before. Therefore, it is important that regardless of the manner software first becomes aware of a context error, it should wait for the context to enter the CXTV\_FN\_ERROR state before examining error information.

Each SDXI function supports a single, function-wide error log message ring in memory for all the contexts of that function; the ring is configured through MMIO space. The error log and its indexes are structured similar to an SDXI descriptor ring and operate in the manner below.

- 1. MMIO\_ERR\_CFG points to the base of the error log. Each error log index corresponds to an aligned 64-byte entry in the error log ring.
  - a. For a given error log index, its address in memory is given by
    - i. log\_sz = 2\*\*(MMIO\_ERR\_CFG.sz +12);
    - ii. log\_bs = MMIO\_ERR\_CFG & ~0xFFF;
    - iii. address = log\_bs + ( (index \* 64) % log\_sz );
  - b. The indices are not expected to reach or exceed (2\*\*64) 1 in practice.
- 2. MMIO\_ERR\_WRT indicates the next available error log index that can be written by the function.
- 3. MMIO\_ERR\_RD indicates the index of the first error log entry not yet consumed by software. As software consumes error log entries, it advances the index in MMIO\_ERR\_RD. For each error log entry consumed, software takes appropriate remediation action. As the ring is finite in size, software shall consume error log entries in a timely manner lest the error log overflows.
- 4. As the function writes more error log entries, it advances the index in MMIO\_ERR\_WRT. The error log is empty when MMIO\_ERR\_WRT is equal to MMIO\_ERR\_RD.

- 5. For each error detected when the error log is enabled, the SDXI function writes a 64-byte aligned error log entry.
  - a. If there is insufficient space in the error log for the function to write an error log entry, the error log shall overflow.
  - b. When an overflow event occurs, the MMIO\_ERR\_STS.ovf bit shall be set, MMIO\_ERR\_WRT shall not advance, and error logging shall be stopped. Contexts may continue to run even when the error log is full or in the overflow state, however detailed error information for new errors will be lost.
  - c. Note that the error log can overflow even if there are available entries in the log when the number of entries in an error log sequence exceed the available entries.
- After writing an error log entry successfully, MMIO\_ERR\_WRT will be advanced by 1, MMIO\_ERR\_STS.sts will be set to "1", and an error log interrupt shall be generated if MMIO\_ERR\_CTL.intr\_en is set to "1". If MSI or MSI-X are enabled, vector 0 will be used for the error log interrupt.

When the function logs a context descriptor error (LogErr:Cxt), it sets ERRLOG\_HD\_ENT.cv to "1". During descriptor processing and before execution, errors may be detected and logged at different processing steps for a single descriptor resulting in multiple error logs. Only one error may be logged for errors occurring during the execution phase of the descriptor operation. Refer to *"5.3, Descriptor Processing"* for more details.

Even though descriptors may reference data buffers in different address spaces and may detect errors when accessing those buffers, the errors are logged with the function hosting the context.

Note that the function may simultaneously consume and execute multiple descriptors from multiple contexts. The order in which errors are detected and their error log entries are written is implementation-dependent. However, each error log entry sequence shall be written by the function in its entirety indivisibly with respect to other error log entries.

Errors may be reported on DMA write operations. While DMA writes are posted in PCI-Express and do not return any status, write response status may be available in implementations that do not connect using a physical PCIe link.

## 3.4.1 Error Log Header Entry (ERRLOG\_HD\_ENT)

The SDXI function reports the details of a specific error event by writing an error log header entry (ERRLOG\_HD\_ENT) to the error log. Error log header entries are formatted as described in *"Table 3-9: Error Log Header Entry (ERRLOG\_HD\_ENT[^3])"* and shown in *"Figure 3-12: Error Log Header Entry (ERRLOG\_ENT)"*. (The entry format is similar to an SDXI descriptor but is made distinct by a "type" field value of "0x7F7".) The SDXI function constructs the error log entry in the following way.

- 1. The SDXI function shall set the "vI" field to "1" and the "type" field to "0x7F7".
- 2. The SDXI function shall set the "re" field, which indicates whether the error was serious enough to stop a context or the entire SDXI function.
- 3. The function shall set the "step" field to allow system software to quickly distinguish between errors resulting from the producer and errors likely resulting from system software, the SDXI function, memory, or other hardware components. The "step" field encodings are shown in *"Table 3-10: (Flagged) Processing Step"*.
- 4. The function should set the "sub\_step" field to the appropriate value whenever this information is known; however, the function may report "0" ("other") otherwise.
- 5. When relevant (see table 3-10), the function should identify the failing context number by setting the "cxt\_num" field and setting the "cv" field to "1".
- 6. When relevant (see table 3-10), the function should identify the failing descriptor index by setting the "dsc\_index" field and setting the "div" field to "1".
- 7. When relevant (see table 3-10), the function should identify one of the failing buffers or AKEYs by setting the "buf" field and setting the "bv" field to "1".
- 8. To assist in offline debug analysis, the function may provide an error class ("err\_class") and vendor specific information ("vendor") applicable to the detected error. The code value returned is implementation dependent and not intended for use by system software for online remediation. The error class codes form a hierarchy as described in *"Table 3 11: Error Class Hierarchy"*. Although, an SDXI function implementation may choose to always return a code of 'Generic Error' (0), it is recommended that the function make best effort to return something more meaningful.

| 31 30 29 28 | 27 26    | 6 25 | 24 2  | 3 22 | 2 21 | 1 20 | ) 19 | 9 1 | 18 3 | 17 | 16  | 15 1  | 4 1 | 3 1 | 12 | 11 | 10  | 9    | 8    | 7    | 6 | 5   | 4 | 3                 | 2      | 2 1   | 0   | )   |        |
|-------------|----------|------|-------|------|------|------|------|-----|------|----|-----|-------|-----|-----|----|----|-----|------|------|------|---|-----|---|-------------------|--------|-------|-----|-----|--------|
| rsv         | rsv type |      |       |      |      |      |      | rsv |      |    |     | ste   | ер  |     |    |    |     |      | ٢sv  | /    |   |     | v | <mark>' </mark> + | + 0x00 |       |     |     |        |
|             |          | C    | xt_nu | ım   |      |      |      |     |      |    |     | rsv   | r   | е   |    | S  | ub_ | _ste | р    | rsv  | , | but | F | rsv               | b      | v div | / C | v + | + 0x04 |
|             |          |      |       |      |      |      |      |     |      | ds | c_i | index |     |     |    |    |     |      |      |      |   |     |   |                   |        |       |     | +   | + 0x08 |
|             |          |      |       |      |      |      |      |     |      |    |     |       |     |     |    |    |     |      |      |      |   |     |   |                   |        |       |     | +   | - 0x0C |
|             |          |      |       |      |      |      |      |     |      |    | rs  | SV .  |     |     |    |    |     |      |      |      |   |     |   |                   |        |       |     | +   | + 0x10 |
|             |          |      |       |      |      |      |      |     |      |    |     |       |     |     |    |    |     |      |      |      |   |     |   |                   |        |       |     | +   | ⊦ 0x14 |
|             |          |      |       |      |      |      |      |     |      |    |     |       |     |     |    |    |     |      |      |      |   |     |   |                   |        |       |     | +   | ⊦ 0x18 |
|             |          |      |       |      |      |      |      |     |      |    |     |       |     |     |    |    |     |      |      |      |   |     |   |                   |        |       |     | +   | + 0x1C |
|             |          |      |       |      |      |      |      |     |      |    |     |       |     |     |    |    |     |      |      |      |   |     |   |                   |        |       |     | +   | ⊦ 0x20 |
|             |          |      |       |      |      |      |      |     |      |    |     |       |     |     |    |    |     |      |      |      |   |     |   |                   |        |       |     | +   | ⊦ 0x24 |
|             |          |      |       |      |      |      |      |     |      |    |     |       |     |     |    |    |     |      |      |      |   |     |   |                   |        |       |     | +   | + 0x28 |
|             |          |      | rsv   |      |      |      |      |     |      |    |     |       |     |     |    |    |     | (    | err_ | clas | S |     |   |                   |        |       |     | +   | + 0x2C |
|             |          |      |       |      |      |      |      |     |      | ٧  | en  | dor   |     |     |    |    |     |      |      |      |   |     |   |                   |        |       |     | +   | + 0x30 |
|             |          |      |       |      |      |      |      |     |      |    |     |       |     |     |    |    |     |      |      |      |   |     |   |                   |        |       |     | +   | + 0x34 |
|             |          |      |       |      |      |      |      |     |      |    |     |       |     |     |    |    |     |      |      |      |   |     |   |                   |        |       |     | +   | + 0x38 |
|             |          |      |       |      |      |      |      |     |      |    |     |       |     |     |    |    |     |      |      |      |   |     |   |                   |        |       |     | +   | ⊦ 0x3C |

Figure 3-12: Error Log Header Entry (ERRLOG\_ENT)

| Field          | Bits    | Subfield   | Description                                                                                                                                                                                                                                                                                                                                                                |
|----------------|---------|------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u32 opcode;    | 000     | vl = 1     | Valid. When 1, indicates the other bits in this data structure are valid. When 0, all other bits in this data structure shall be ignored.                                                                                                                                                                                                                                  |
|                | 007:001 | rsvd       | Shall be set to "0"                                                                                                                                                                                                                                                                                                                                                        |
|                | 013:008 | step       | Identifies the SDXI function processing step that encountered an error. See<br>"Table 3-10: (Flagged) Processing Step"                                                                                                                                                                                                                                                     |
|                | 015:014 | rsvd       | Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                         |
|                | 026:016 | type=0x7F7 | Descriptor type 0x7F7 = ERRLOG_ENT. See section "6.1.1".                                                                                                                                                                                                                                                                                                                   |
|                | 031:027 | rsvd       | Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                         |
| u16 misc0;     | 32      | cv         | Indicates if the cxt_num field is valid. See See "Table 3-10: (Flagged)<br>Processing Step" for when "cv" must be set.                                                                                                                                                                                                                                                     |
|                | 33      | div        | Indicates if the dsc_index field is valid. See See "Table 3-10: (Flagged)<br>Processing Step" for when "div" must be set.                                                                                                                                                                                                                                                  |
|                | 34      | bv         | Buffer Valid: indicates if the "buf" field is valid. See See "Table 3-10:<br>(Flagged) Processing Step" for when "bv" must be set.                                                                                                                                                                                                                                         |
|                | 35      | rsvd       | Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                         |
|                | 38:36   | buf        | For SDXI descriptors with multiple Buffers or Akey indexes, identifies which<br>instance was involved in the error. If an SDXI function implementation<br>detects more than one akey/buffer failure while processing a descriptor, the<br>implementation shall choose one of them to log.<br>0 = Buffer0 or Akey0<br>1 = Buffer1 or Akey1<br>2-7 = reserved for future use |
|                | 39      | rsvd       | Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                         |
|                | 43:40   | sub_step   | Identifies the specific sub-step that failed<br>0 = Other (or Internal Error) (May be used when sub-step is unknown)<br>1 = Address Translation Failure<br>2 = Data Access Failure (i.e. Read/Write/Atomic Access)<br>3 = Data Validation Failure (i.e. data content is invalid)<br>-15 = reserved for future use                                                          |
|                | 46:44   | re         | <ul> <li>SDXI Function Reaction to the Error</li> <li>Informative Entry, nothing stopped</li> <li>SDXI Context Stopped ("cv" and "cxt_num" shall be valid).</li> <li>SDXI Function Stopped</li> <li>3-7 = Reserved for future use</li> </ul>                                                                                                                               |
|                | 47      | rsvd       | Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                         |
| u16 cxt_num;   | 063:048 |            | Context number associated with the error log entry when cv = "1".                                                                                                                                                                                                                                                                                                          |
| u64 dsc_index; | 127:064 |            | Descriptor index of descriptor that failed when "div" ="1".                                                                                                                                                                                                                                                                                                                |
| u8 rsvd_0[28]; | 351:128 |            | Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                         |
| u16 err_class; | 367:352 |            | Provides an error classification code for offline debug analysis. See discussion above.                                                                                                                                                                                                                                                                                    |
| u8 rsvd 1[2];  | 383:368 |            | Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                         |
| u32 vendor[4]; | 511:384 |            | These fields are available for SDXI functions to record additional vendor defined debugging information relating to the error, such as telemetry relating to internal errors.                                                                                                                                                                                              |

Table 3-9: Error Log Header Entry (ERRLOG\_HD\_ENT<sup>[^3]</sup>)

| Table | 3-10: | (Flagged) | Processing | Step |
|-------|-------|-----------|------------|------|
|-------|-------|-----------|------------|------|

| Encod | ding (Enc) & Types                                                        | Re  | q'd Fi | elds | CDVI Eurotian Action (Decrement)                                 |  |  |  |
|-------|---------------------------------------------------------------------------|-----|--------|------|------------------------------------------------------------------|--|--|--|
| Enc   | Processing Step                                                           | с٧  | div    | bv   | SDAI Function Action (Response)                                  |  |  |  |
| 0     | reserved                                                                  | n/a | n/a    | n/a  | n/a                                                              |  |  |  |
| 1     | Internal Error (ERRV_INT)                                                 | Х   | 0      | 0    | Function Stop or Context Stop ('cv' shall be 1 if Context Stop). |  |  |  |
| 2     | Context Level 2 Table Entry – Translate,<br>Read, Validate. (ERRV_CXT_L2) | 1   | 0      | 0    | Function Stop or Context Stop.                                   |  |  |  |
| 3     | Context Level 1 Table Entry – Translate,<br>Read, Validate. (ERRV_CXT_L1) | 1   | 0      | 0    | Function Stop or Context Stop.                                   |  |  |  |
| 4     | Context Control – Translate, Read,<br>Validate. (ERRV_CXT_CTL)            | 1   | 0      | 0    | Function Stop or Context Stop.                                   |  |  |  |
| 5     | Context Status – Translate, Access,<br>Validate. (ERRV_CXT_STS)           | 1   | 0      | 0    | Function Stop or Context Stop.                                   |  |  |  |
| 6     | Write_Index – Translate, Read, Validate.<br>(ERRV_WRT_IDX)                | 1   | 0      | 0    | Context Stop.                                                    |  |  |  |
| 7     | Descriptor Entry – Translate, Access,<br>Validate. (ERRV_DSC_GEN)         | 1   | 1      | 0    | Context Stop.                                                    |  |  |  |
| 8     | Descriptor CST_BLK – Translate, Access,<br>Validate. (ERRV_DSC_CSB)       | 1   | 1      | 0    | Context Stop.                                                    |  |  |  |
| 9     | Atomic Return Data – Translate, Access.<br>(ERRV_ATOMIC)                  | 1   | 1      | 0    | Context Stop.                                                    |  |  |  |
| 10    | Descriptor: Data Buffer – Translate,<br>Access. (ERRV_DSC_BUF)            | 1   | 1      | 1    | Context Stop.                                                    |  |  |  |
| 11    | Descriptor AKey Lookup – Translate,<br>Access, Validate. (ERRV_DSC_AKEY)  | 1   | 1      | 1    | Context Stop.                                                    |  |  |  |
| 12    | Function RKey Lookup – Translate, Read, Validate. (ERRV_FN_RKEY)          | 0   | 0      | 0    | Informative Entry, nothing stopped                               |  |  |  |
| 13-63 | reserved                                                                  | n/a | n/a    | n/a  | n/a                                                              |  |  |  |

#### Table 3-11: Error Class Hierarchy

| Bit layo  | Bit layout and definitions for err_class |                                  |                                |                                                                      |  |  |  |  |  |  |
|-----------|------------------------------------------|----------------------------------|--------------------------------|----------------------------------------------------------------------|--|--|--|--|--|--|
| [15:12]   | [11:08]                                  | [07:04]                          | [03:00]                        |                                                                      |  |  |  |  |  |  |
| 0x0000    | Generic                                  | SDXI Error                       |                                |                                                                      |  |  |  |  |  |  |
|           | 0x1000                                   | 000 SDXI Internal Function Error |                                |                                                                      |  |  |  |  |  |  |
|           | 0x2000                                   | Generic Log                      | neric Logical Error            |                                                                      |  |  |  |  |  |  |
|           |                                          | 0x2100                           | Unsuppo                        | Insupported Field Encoding                                           |  |  |  |  |  |  |
|           |                                          | 0x2200                           | Non-zero                       | Non-zero reserved field used                                         |  |  |  |  |  |  |
|           |                                          | 0x2300                           | Specifica                      | ation, Implementation, or instance limit exceeded                    |  |  |  |  |  |  |
|           |                                          |                                  | 0x2310                         | max_buffer limit exceeded                                            |  |  |  |  |  |  |
|           |                                          |                                  | 0x2320                         | illegal or invalid AKey Index                                        |  |  |  |  |  |  |
|           |                                          |                                  | 0x2330                         | illegal or invalid context index                                     |  |  |  |  |  |  |
|           |                                          |                                  | 0x2340                         | illegal or invalid descriptor ring size                              |  |  |  |  |  |  |
|           |                                          |                                  | 0x2350                         | illegal values for Read_Index or Write_Index (overflow or underflow) |  |  |  |  |  |  |
|           |                                          |                                  | 0x2360                         | illegal PASID (implementation limit or format)                       |  |  |  |  |  |  |
|           |                                          |                                  | 0x2370                         | illegal intr_num (implementation limit or format)                    |  |  |  |  |  |  |
|           |                                          | 0x2400                           | Unsuppo                        | orted descriptor type/subtype encoding                               |  |  |  |  |  |  |
|           |                                          | 0x2500                           | Time-ou                        | t waiting for descriptor vI bit to become set                        |  |  |  |  |  |  |
|           | 0x3000                                   | Generic Me                       | mory Acc                       | ess Error                                                            |  |  |  |  |  |  |
|           |                                          | 0x3100                           | Memory                         | Access Permission Error                                              |  |  |  |  |  |  |
|           |                                          |                                  | 0x3110                         | PCIe Unsupported Request Error                                       |  |  |  |  |  |  |
|           |                                          |                                  | 0x3120                         | PCIe Completer Abort Error                                           |  |  |  |  |  |  |
|           |                                          | 0x3200                           | Memory                         | Access Data Poison Error                                             |  |  |  |  |  |  |
|           |                                          | 0x3300                           | Memory                         | Access Time-out                                                      |  |  |  |  |  |  |
|           |                                          | 0x3400                           | Memory                         | Translation Error                                                    |  |  |  |  |  |  |
|           |                                          |                                  | 0x3410                         | PCIe Translation Completion Unsupported Request Error                |  |  |  |  |  |  |
|           |                                          |                                  | 0x3420                         | PCIe Translation Completion Completer Abort Error                    |  |  |  |  |  |  |
|           |                                          | 0x3500                           | Page Request Error or Time-out |                                                                      |  |  |  |  |  |  |
|           |                                          |                                  | 0x3510                         | PCIe PRG Invalid Request Response Error                              |  |  |  |  |  |  |
|           |                                          |                                  | 0x3520                         | PCIe PRG Failure Response Error                                      |  |  |  |  |  |  |
| All other | encoding                                 | g reserved                       |                                |                                                                      |  |  |  |  |  |  |

## 3.4.2 Error Log Initialization

To initialize or re-initialize the error log, the following procedure shall be used by software.

- 1. Clear MMIO\_ERR\_CFG.en to "0".
- 2. Clear MMIO\_ERR\_STS to "0".
- 3. To hold the error log, allocate a 4-KB aligned contiguous memory buffer whose size is a power-oftwo of at least 4-KB. Initialize the buffer to zero.
- 4. Set MMIO\_ERR\_CTL.intr\_en to "1" if interrupts on errors are desired.
- 5. Clear MMIO\_ERR\_WRT and MMIO\_ERR\_RD to "0".
- 6. Program MMIO\_ERR\_CFG:
  - a. Program MMIO\_ERR\_CFG.ptr to point to the start of the error log memory region.
  - b. Program MMIO\_ERR\_CFG.sz to the appropriate size of the error log.
  - c. Set MMIO\_ERR\_CFG.en to "1" to enable the error log mechanism.

#### 3.4.3 Error Log Processing by Software

Software may use the following set of steps to process the error log.

- 1. Check MMIO\_ERR\_STS and perform any required remediation.
- 2. If MMIO\_ERR\_STS.sts is "1", then compute read\_index; otherwise exit this set of steps.
  - read\_index = MMIO\_ERR\_RD;
- 3. Clear MMIO\_ERR\_STS to "0". This clears all error log overflow, error, and status flags. If using an edge triggered interrupt mechanism, the system should be ready to accept another interrupt without loss before this MMIO write to MMIO\_ERR\_STS.
- 4. Compute write\_index. The MMIO read of MMIO\_ERR\_WRT must follow step 3 to ensure that either software is working with the latest value of MMIO\_ERR\_WRT or another interrupt will occur after software completes this entire set of steps.
  - write\_index = MMIO\_ERR\_WRT;
- 5. If read\_index == write\_index exit this set of steps.
- 6. If read\_index < write\_index, then proceed to the next step; otherwise go to step 11.
- 7. Compute the following:
  - log\_sz = 2\*\*(MMIO\_ERR\_CFG.sz +12);
  - log\_bs = MMIO\_ERR\_CFG & ~0xFFF;
- 8. Process the error log entry at this address:
  - address = log\_bs + ( (read\_index \* 64) % log\_sz );
- 9. Advance read\_index.
- 10. Loop back to step 6.
- 11. Write read\_index to MMIO\_ERR\_RD.

# 3.5 Administrative Context (Context 0)

Administrative operations are used by privileged software to manage the SDXI function and are only supported in Context 0 ("Administrative" Context) within each function. Software shall ensure context 0 is the first context started and the last stopped to ensure proper operation. When context 0 goes into an undefined or error state, software shall stop and restart the function to ensure correct operation.

An SDXI function may only cache private copies of all LVL\_L2 data (see "4.3.1, SDXI Memory-Based Data-Structure Hierarchy and Caching") associated with context 0 when the context is at the CXTV\_RUN state.

An Administrative Context only supports AdminGrp, ConnectGrp and other specified administrative operations; no other operation groups including the DmaBaseGrp operations are supported. A VF Administrative Context may be used to manage contexts associated with that VF. A PF Administrative Context may be used to manage contexts hosted by the PF or contexts hosted by any VF associated with the PF. If a VF encounters a failure while processing a P2V administrative operation, the error will be logged by the VF.

# 3.6 Data Steering Hints (DSH)

An SDXI Function may allow for the inclusion of DSH on data buffer accesses. One potential use of DSH is to provide a cache injection hint that allows the data buffer to be injected into a CPU cache. The actual encoding, propagation, and use of DSH information is platform specific and outside the scope of this specification.

The mapping of DSH information onto device architecture mechanisms is specific to those architectures. For details on how DSH information maps onto PCIe, please refer to "8.3, Mapping SDXI DSH to PCIe TLP Processing Hints (PCIe TPH)".

The context producer may request the inclusion of DSH information on a data buffer access by setting the data buffer's "attr.coh\_ctl" field to "10b" in the descriptor.

DSH information is only included when all of the following conditions are true:

- All required bus/device mechanisms to support DSH are present and enabled
- The data buffer's "attr.coh\_ctl" field is "10b"
- If the data buffer's "AKEY\_ENT.sfunc" field is zero, the "AKEY\_ENT.ste" field is set to 1
- If the data buffer's "AKEY\_ENT.sfunc" field is non-zero, the data buffer's "RKEY\_ENT.ste" field is set to 1

A 2-bit ph field and 16-bit stag field are provided through the data buffer's AKey or RKey table entry. SDXI implementations are free to apply the requested DSH information on all, some, or none of the accesses to the referenced data buffer.

Software is responsible for ensuring that the target of a request containing DSH information supports receiving DSH information. Software is also responsible for ensuring that the bus infrastructure between the requesting SDXI Function and the data buffer target supports the propagation of DSH information.

The reaction of a target to receiving unexpected DSH information or encodings is outside the scope of this specification.

## 4.1 SDXI Function State

An SDXI function operates serially within a number of basic states indicated by the MMIO\_STS0.fn\_gsv register and shown in the figure below. Software may use the MMIO\_CTL0.fn\_gsr register to help transition the function between the various states.





| MMIO_S              | MMIO_STS0.fn_gsv values |  |  |  |  |  |  |  |
|---------------------|-------------------------|--|--|--|--|--|--|--|
| 0b000               | GSV_STOP                |  |  |  |  |  |  |  |
| 0b001               | GSV_INIT                |  |  |  |  |  |  |  |
| 0b010               | GSV_ACTIVE              |  |  |  |  |  |  |  |
| 0b011               | GSV_STOPG_SF            |  |  |  |  |  |  |  |
| 0b100               | GSV_STOPG_HD            |  |  |  |  |  |  |  |
| 0b101               | GSV_ERROR               |  |  |  |  |  |  |  |
| All Others Reserved |                         |  |  |  |  |  |  |  |

| MMIO_CTL0.fn_gsr |              |  |  |  |  |  |  |  |
|------------------|--------------|--|--|--|--|--|--|--|
| 0b00             | GSRV_RESET   |  |  |  |  |  |  |  |
| 0b01             | GSRV_STOP_SF |  |  |  |  |  |  |  |
| 0b10             | GSRV_STOP_HD |  |  |  |  |  |  |  |
| 0b11             | GSRV_ACTIVE  |  |  |  |  |  |  |  |

### 4.1.1 G0: GSV\_STOP State

The function is idle and not enabled to process any memory data structures or perform DMA operations. All DMA operations from remote functions targeting this function's local address spaces are aborted.

From the GSV\_STOP state, software may set MMIO\_CTL0.fn\_gsr to GSRV\_ACTIVE to start the function. The function will attempt to transition through the GSV\_INIT state into the GSV\_ACTIVE state. Software shall program MMIO\_CXT\_L2 prior to setting MMIO\_CTL0.fn\_gsr to GSRV\_ACTIVE. Writing any other value to MMIO\_CTL0.fn\_gsr is ignored.

#### 4.1.2 G1: GSV\_INIT State

Upon receiving the GSRV\_ACTIVE request while in the GSV\_STOP state, the function shall transition into the GSV\_INIT state to perform any necessary self-initialization. All DMA operations from remote functions targeting this function's local address spaces are aborted in this state.

If initialization is successful, the function transitions to the GSV\_ACTIVE state. As part of the initialization, the SDXI function may validate the contents of the following MMIO registers.

• MMIO\_CTL0 and MMIO\_CTL2.

The SDXI function initializes the contents of the following MMIO registers.

• MMIO\_STS0.

While in this state, if an error occurs during the initialization that prevents the processing of all contexts or software sets MMIO\_CTL0.fn\_gsr to any value other than GSRV\_ACTIVE, the function shall initiate a HaltErr:Fn action, which includes transitioning MMIO\_STS0.fn\_gsv to the GSV\_ERROR state. Illegally writing GSRV\_RESET to MMIO\_CTL0.fn\_gsr may cause the SDXI function to immediately begin transitioning out of the GSV\_ERROR state upon the completion of the HaltErr:Fn action.

### 4.1.3 G2: GSV\_ACTIVE State

In the GSV\_ACTIVE state, the function is enabled to process contexts and descriptors. From this state, privileged software may request the function to stop by writing MMIO\_CTL0.fn\_gsr to GSRV\_STOP\_HD to request a hard stop or to GSRV\_STOP\_SF to request a soft stop.

While in this state, if an error occurs that prevents the processing of all contexts or software sets MMIO\_CTL0.fn\_gsr to GSRV\_RESET, the function shall initiate a HaltErr:Fn action, which includes transitioning MMIO\_STS0.fn\_gsv to the GSV\_ERROR state. If HaltErr:Fn was triggered due to SW writing GSRV\_RESET to MMIO\_CTL0.fn\_gsr, then once HaltErr:Fn completes, the function may immediately begin transitioning out of GSV\_ERROR to GSV\_STOP.

## 4.1.4 G3: GSV\_STOPG\_SF ("Soft Stopping") State

In the GSV\_STOPG\_SF state, the function shall stop processing new descriptors and wait until all outstanding descriptors complete naturally or until an implementation-defined timeout period has expired at which point any remaining descriptors are aborted. The function shall log errors as necessary for each aborted descriptor. See "4.3.5, Function and Context Stop Actions".

When all contexts have reached their final state, and all previously received MMIO doorbells are fully evaluated (i.e. there are no outstanding data structure reads due to a prior doorbell), the function transitions to the GSV\_STOP state.

While in this state, if an error occurs that prevents one or more contexts from stopping, the function shall initiate a HaltErr:Fn action, which includes transitioning MMIO\_STS.fn\_gsv to the GSV\_ERROR state. Descriptor errors including aborts alone do not drive a transition to the GSV\_ERROR state.

Writing GSRV\_STOP\_HD to MMIO\_CTL0.fn\_gsr transitions MMIO\_STS.fn\_gsv to the GSV\_STOPG\_HD state. Writing any other value to MMIO\_CTL0.fn\_gsr is ignored.

# 4.1.5 G4: GSV\_STOPG\_HD ("Hard Stopping") State

In the GSV\_STOPG\_HD state, the function stops processing new descriptors and waits less patiently for all outstanding descriptors to complete; and aborts outstanding descriptors more aggressively to speed up the completion wait. The function shall log errors as necessary for each aborted descriptor. See "4.3.5, *Function and Context Stop Actions*". Used as a last resort to stop the function.

When all contexts have reached their final state, and all previously received MMIO doorbells are fully evaluated (i.e. there are no outstanding data structure reads due to a prior doorbell), the function transitions to the GSV\_STOP state.

While in this state, if an error occurs that prevents one or more contexts from stopping, the function shall initiate a HaltErr:Fn action, which includes transitioning MMIO\_STS0.fn\_gsv to the GSV\_ERROR state. Writing any value to MMIO\_CTL0.fn\_gsv is ignored.

### 4.1.6 G5: GSV\_ERROR State

If an uncorrectable error prevents further safe operation of a function across all of its associated contexts, the function shall initiate a HaltErr:Fn action, which includes transitioning MMIO\_STS0.fn\_gsv to the GSV\_ERROR state. If MMIO\_CTL0.fn\_err\_intr\_en is "1", an interrupt to software is generated upon entering this state.

All DMA operations from remote functions targeting this function's local address spaces are aborted in this state. It is implementation-dependent if the function cleans up any outstanding DMA operations in this state.

It is implementation dependent if the function performs any context-specific state changes or stop actions when it enters this state; software shall not rely upon it. Software may return the function to the GSV\_STOP state by performing one these actions.

- 1. Set MMIO\_CTL0.fn\_gsr to GSRV\_RESET and wait until MMIO\_STS0.fn\_gsv is GSV\_STOP. It is implementation dependent what the contents of context-specific state are after a GSRV\_RESET is requested.
- 2. Perform a function-level reset at the device level.

Writing any value other than GSRV\_RESET to MMIO\_CTL0.fn\_gsr is ignored.

## 4.1.7 Function Reset and Outstanding DMA Requests

When a device-level reset is being performed on an SDXI function, there may be outstanding DMA requests from this specific function to both local and remote address spaces. This function may also receive DMA from other SDXI functions targeting this function's local address spaces. Those remote functions may continue to issue new DMA requests attempting to access this function's local address spaces while this function is resetting.

An SDXI function shall terminate all processing (descriptor ingest, descriptor execution, incoming RKeybased access, error reporting, etc.) and expeditiously clear (complete or cleanly abort) any outstanding external interactions. When complete, the function shall be stopped and MMIO\_STS0.fn\_gsv shall equal GSV\_STOP.

It is privileged software's responsibility to notify the remote software contexts that the target function is being reset. It shall also ensure either orderly or unorderly removal of all associated connections prior to re-enabling the function that was reset.

### 4.1.8 Activation of the SDXI Function by Software

Software shall activate the SDXI function from the GSV\_STOP state by performing the following steps.

- 1. Configure Capabilities:
  - a. Read MMIO\_CAP0 and MMIO\_CAP1 to discover the supported SDXI features. If restoring saved function content, verify that the function's capabilities match the requirements of the saved function content.
  - b. Write MMIO\_CTL2 to configure the SDXI features set exposed to driver software.
- 2. Context Level 2 Table Setup:
  - a. Allocate an aligned 4KB region of memory for the Context Level 2 Table
  - b. Initialize all Context Level 2 Table entries to "0" or restore them from the saved function content and adjust accordingly.
  - c. Program the MMIO\_CXT\_L2 register to point to the Context Level 2 Table location in memory.
    - If guest virtual addressing is required, set MMIO\_CTL0.fn\_pasid and MMIO\_CTL0.fn\_pasid\_vl to appropriate values.
- 3. Context Level 1 Table Setup:
  - a. Allocate an aligned 4KB region of memory for the first Context Level 1 Table (contexts 0 to 127). Other Context Level 1 Tables may also be allocated at this step.
  - b. Initialize these Context Level 1 Table entries to "0" or restore them the saved function content and adjust accordingly.
  - c. Initialize the Context level 2 table to point to the Context Level 1 tables allocated above.
- 4. Administrative Context:
  - a. Recommended: if not restoring a saved administrative context, software should create the function's administrative context (context 0) and all associated structures.
  - b. The context's CXT\_STS.state value should be set to "CXTV\_RUN" to allow the context to be jump started, see step 10b.
- 5. Mailbox: If present, initialize the mailbox registers
- 6. If restoring saved function and context state, restore and adjust the state as appropriate.
- 7. Error Log: Initialize or restore the Error Log. Refer to "3.4.2, Error Log Initialization" for more details.
- Software may also need to configure and enable additional PCIe standards-based features such as enabling PCIe "AtomicOp Requester Enable" (PCIe Device Control 2 Register), MSI/MSI-X, ATS, PRI, and TPH based on the desired use cases.
- 9. Set MMIO\_CTL0.fn\_gsr to GSRV\_ACTIVE.
  - a. Wait until MMIO\_STS0.fn\_gsv is GSV\_ACTIVE or GSV\_ERROR before performing any other actions on the function.
  - b. If the function state becomes GSV\_ERROR, software shall use the procedures described in *"4.1.6, G5: GSV\_ERROR State"* to transition the function to GSV\_STOP.
- 10. Once the function is at GSV\_ACTIVE state, software may start the created and restored contexts using one of the methods below.
  - a. From the PF, the contexts of a VF can be started using a P2V.DSC\_CXT\_START\_RS operation.
  - b. From the local function, context 0 can be "jump started" (See "4.3.4, Starting A Context and Context Signaling"); then a LOC.DSC\_CXT\_START\_RS operation can be issued from that context to start the remaining ones.

#### 4.1.9 Stopping of the SDXI Function by Software

Software shall transition the SDXI function from GSV\_ACTIVE to GSV\_STOP state by performing the following steps.

- 1. Set MMIO\_CTL0.fn\_gsr to GSRV\_STOP\_[SF, HD].
  - a. This will cause the function to enter GSV\_STOPG\_[SF, HD] and prevent any new operations from being processed in all contexts of the function.
- 2. Wait until MMIO\_STS0.fn\_gsv is GSV\_STOP or GSV\_ERROR.
  - a. If the function is at GSV\_STOPG\_SF and is taking longer than acceptable to stop, software can Set MMIO\_CTL0.fn\_gsr to GSRV\_STOP\_HD to change the completion policy to hard-stop; note, this may cause more outstanding descriptors to be aborted.
- 3. If the function state becomes GSV\_ERROR, software shall use the procedures described in "4.1.6, G5: GSV\_ERROR State" to transition the function to GSV\_STOP.
- 4. Once the function is at GSV\_STOP, software may save the set of all current function contents including the error log, and optionally restore a different set of function contents.

# 4.2 SDXI Context State

An SDXI context operates serially within a number of basic states determined by the CXT\_STS.state location in memory, the context valid bits (CXT\_L2\_ENT.vI, CXT\_L1\_ENT.vI, and CXT\_CTL.vI), and the function state.





| CXT_STS.s                 | tate                                                                                                                                                                                                                                            |            | Key in {Set}       | The value of "key" is within the values belonging to "Set"         |  |  |  |  |  |  |
|---------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|--------------------|--------------------------------------------------------------------|--|--|--|--|--|--|
| 0b0000                    | CXTV_STOP_SW                                                                                                                                                                                                                                    |            | {Gsv_Run}          | The set containing the values GSV_ACTIVE and GSV_STOPG_[SF, HD]    |  |  |  |  |  |  |
| 0b0001                    | CXTV RUN                                                                                                                                                                                                                                        |            | Cs                 | CXT_STS.state                                                      |  |  |  |  |  |  |
| 0b0010                    | CXTV_STOPG_SW                                                                                                                                                                                                                                   |            | fn_gsv             | Target Function's MMIO_STS0.fn_gsv                                 |  |  |  |  |  |  |
| 060100                    |                                                                                                                                                                                                                                                 |            | SW:                | Software Initiated State Transition                                |  |  |  |  |  |  |
| 00100                     | UNIV_SIUF_FIN                                                                                                                                                                                                                                   |            | FN:                | Function Initiated State Transition                                |  |  |  |  |  |  |
| 0b0110                    | CXTV_STOPG_FN                                                                                                                                                                                                                                   |            | nnn                | Software Initated State Transition causing CXTV_UNDEF.             |  |  |  |  |  |  |
| 0b1111                    | CXTV_ERR_FN                                                                                                                                                                                                                                     | XTV_ERR_FN |                    |                                                                    |  |  |  |  |  |  |
| !value                    | Ivalue For booleans, "I1" = "0" and "I0" = "1". For non-booleans, maps to any other allowed attribute value that is not "value".                                                                                                                |            |                    |                                                                    |  |  |  |  |  |  |
| key = value               | e Indicates that a state a                                                                                                                                                                                                                      | ttribu     | te ("key") is "va  | lue". When used in a state transition, the statement must be true. |  |  |  |  |  |  |
| key → valu                | e A transition of a state a                                                                                                                                                                                                                     | ttribu     | ute ("key") to "va | alue".                                                             |  |  |  |  |  |  |
| Cv                        | Cv Aggregate of (CXT_L2_ENT.vl and CXT_L1_ENT.vl and CXT_CTL.vl). Cv = 1 means all valid bits are "1". Cv = 0 means at least one valid bit is "0". Cv $\rightarrow$ n, means one or more valid bits have been changed such that Cv = n.         |            |                    |                                                                    |  |  |  |  |  |  |
| Fi*                       | Fi* Implementations may transition into CXTV_RUN.EXEC state without receiving a doorbell write by reading Write_Index and Read_Index from memory. Regardless, Software shall always issue doorbell writes in order to ensure correct operation. |            |                    |                                                                    |  |  |  |  |  |  |
| Not shown:<br>determining | Not shown: Regardless if an Administrative Descriptor is issued by the PF or the Local function (target), the target function's Gsv is used in determining its state transitions                                                                |            |                    |                                                                    |  |  |  |  |  |  |

## 4.2.1 S1: CXTV\_INVALID State

The SDXI function is in GSV\_ACTIVE state and managing the context; but the context is invalid because one or more of the following attributes are not "1": CXT\_L2\_ENT.vI, CXT\_L1\_ENT.vI, and CXT\_CTL.vI. In this state, the function does not process descriptors and does not respond to doorbell writes for the context. This state can be entered from the following.

- CXTV\_STOP\_[SW, FN] or CXTV\_ERR\_FN when software changes one or more of the context valid bits to "0".
- When the function transitions to GSV\_ACTIVE and one or more of the context valid bits are "0".

This state can transition to the following.

- CXTV\_RUN, CXTV\_ERR\_FN, or CXTV\_STOP\_[SW, FN] based on the value of CXT\_STS.state when the function is at GSV\_ACTIVE and software changes one or more of the context valid bits such that all are at "1".
- CXTV\_UNDEF when the function is at GSV\_ACTIVE, software changes one or more of the context valid bits such that all are at "1", and CXT\_STS.state is not CXTV\_RUN, CXTV\_ERR\_FN, or CXTV\_STOP\_[SW, FN].

Unless explicitly required by an action or operation to suppress or signal an error, an implementation of an SDXI function should suppress signaling errors on a context in CXT\_INVALID state. However, the implementation may not be able to suppress the signaling of these errors in all cases. As a result, software shall expect and dispose of unexpected errors on a context in CXT\_INVALID state.

## 4.2.2 S2: CXTV\_STOP\_[SW, FN] States

In these states the function does not process descriptors and does not respond to doorbell writes for the context. The target context shall be stopped at a descriptor boundary. This state can be entered from the following.

- CXTV\_STOPG\_[SW, FN] by the function when it detects that no error occurred in context processing and context execution.
- CXTV\_ERR\_FN when software changes CXT\_STS.state to CXTV\_STOP\_[SW, FN]. (It is recommended that software resolve the error before making this change.)
- CXTV\_INVALID when the function is at GSV\_ACTIVE, CXT\_STS.state is CXTV\_STOP\_[SW, FN], and all context valid bits are "1".

If context 0 (Administrative context) enters this state, the function shall invalidate and exclude all privately cached copies of LVL\_L2 data associated with context 0 until the context is once again at CXTV\_RUN. (See "4.3.1, SDXI Memory-Based Data-Structure Hierarchy and Caching")

This state can transition to the following.

- CXTV\_RUN when the function is at GSV\_ACTIVE and software performs either a successful DSC\_CXT\_START\_[NM, RS] operation or a change of CXT\_STS.state to "CXTV\_RUN (see "4.3.4, Starting A Context and Context Signaling").
- CXTV\_INVALID when software changes one or more of the context valid bits to "0".
- CXTV\_UNDEF when software changes CXT\_STS.state to values other than CXTV\_STOP\_[SW, FN] and CXTV\_RUN.

#### 4.2.2.1 CXTV\_STOP\_SW State

The context was stopped by software using a local-function DSC\_CXT\_STOP operation on the context.

#### 4.2.2.2 CXTV\_STOP\_FN State

The context was stopped either by a transition of the function state from GSV\_ACTIVE or a P2V.DSC\_CXT\_STOP operation.

#### 4.2.3 S3: CXTV\_ERR\_FN

This is the context error state; the function does not process descriptors and does not respond to doorbell writes for the context. The target context shall be stopped at a descriptor boundary. In response, software shall diagnose and correct any underlying error conditions. Once the error has been resolved, software may transition the target context to CXTV\_STOP\_[SW, FN]. This state can be entered from the following.

- CXTV\_STOPG\_[SW, FN] when the function detects an error in context processing or context execution.
- CXTV\_INVALID when the function is at GSV\_ACTIVE, CXT\_STS.state is CXTV\_ERR\_FN, and all context valid bits are "1".

If context 0 (Administrative context) enters this state, the function shall invalidate and exclude all privately cached copies of LVL\_L2 data associated with context 0 until the context is once again at CXTV\_RUN. (See "4.3.1, SDXI Memory-Based Data-Structure Hierarchy and Caching")

This state can transition to the following.

- CXTV\_STOP\_[SW, FN] when software changes CXT\_STS.state to CXTV\_STOP\_[SW, FN].
- CXTV\_INVALID when software changes one or more of the context valid bits to "0".
- CXTV\_UNDEF when software changes CXT\_STS.state to values other than CXTV\_ERR\_FN and CXTV\_STOP\_[SW, FN].

## 4.2.4 S4: CXTV\_RUN.[RDY, EXEC] States

In this state, the context has been enabled to process descriptors and responds to doorbell writes. This state can be entered from the following.

- CXTV\_STOP\_[SW, FN] when the function is at GSV\_ACTIVE and software performs either a successful DSC\_CXT\_START\_[NM, RS] operation or a change of CXT\_STS.state to CXTV\_RUN.
- CXTV\_INVALID when the function is at GSV\_ACTIVE, CXT\_STS.state is CXTV\_RUN, and all context valid bits are "1".

Entry to this state always enters the CXTV\_RUN.RDY substate.

The CXTV\_RUN state can transition to the following.

- CXTV\_STOPG\_FN when the function detects an error in context processing or context execution.
- CXTV\_STOPG\_FN when the function transitions to GSV\_STOPG.
- CXTV\_STOPG\_FN by a software-initiated, PF-to-VF-function, administrative DSC\_CXT\_STOP operation.
- CXTV\_STOPG\_SW by a software-initiated, local-function, administrative DSC\_CXT\_STOP operation.
- CXTV\_UNDEF when software changes CXT\_STS.state to any value or one or more of the context valid bits to "0".

#### 4.2.4.1 S4a: CXTV\_RUN.RDY State

In this substate of CXTV\_RUN, the context is enabled ("ready") to process descriptors, all previous context descriptors have been completed, and the function is evaluating whether new descriptors are available. The context may remain in this state until the function determines that there are available descriptors.

The SDXI function may transition the context to the CXTV\_RUN.EXEC (S4b) state to process available descriptors when the following conditions are true.

- 1. The context is in CXTV\_RUN.RDY.
- 2. The context's Doorbell register is updated externally or internally with a doorbell\_value as described in "4.3.3, Doorbell Register and Context Signaling"
- 3. The doorbell\_value is greater than Read\_Index and the doorbell\_value is less than or equal to Write\_Index.

#### 4.2.4.2 S4b: CXTV\_RUN.EXEC State

In this substate, the context is actively processing new descriptors. The function may return the context to CXTV\_RUN.RDY state when the last known doorbell\_value equals Write\_Index.

## 4.2.5 S5: CXTV\_STOPG\_[SW, FN] States

In these states, the function stops processing new descriptors for the context and waits for outstanding context descriptors to complete based on the completion wait policy.

- **Soft-Stop (SF) Wait**: The function shall wait until all outstanding descriptors complete naturally or until an implementation-defined timeout period has expired at which point any remaining descriptors are aborted.
- Hard-Stop (HD) Wait: The function waits less patiently for all outstanding descriptors to complete; and may abort outstanding descriptors more aggressively to speed up the completion wait. This policy can be applied by an initial stop action on the context or by later stop actions on the same context. Once applied, the completion policy cannot be relaxed while in the context is in this state.

Refer to CXTV\_STOPG\_SW and CXTV\_STOPG\_FN for more details.

#### 4.2.5.1 CXTV\_STOPG\_SW State

This state can be entered from the following.

• CXTV\_RUN by a software-initiated, local-function, administrative DSC\_CXT\_STOP operation.

This state can transition to the following.

- CXTV\_STOP\_SW by the function when it detects that no error occurred in context processing and context execution.
- CXTV\_ERR\_FN when the function detects an error in context processing or context execution.
- CXTV\_UNDEF when software changes CXT\_STS.state to any value or one or more of the context valid bits to "0".

#### 4.2.5.2 CXTV\_STOPG\_FN State

This state can be entered from the following.

- CXTV\_RUN when the function detects an error in context processing or context execution.
- CXTV\_RUN when the function transitions to GSV\_STOPG.
- CXTV\_RUN by a software-initiated, PF-to-VF-function, administrative DSC\_CXT\_STOP operation.

This state can transition to the following.

- CXTV\_STOP\_FN by the function when it detects that no error occurred in context processing and context execution.
- CXTV\_ERR\_FN when the function detects an error in context processing or context execution.
- CXTV\_UNDEF when software changes CXT\_STS.state to any value or one or more of the context valid bits to "0".

## 4.2.6 Context-Undefined Operation (CXTV\_UNDEF)

When privileged software directly and illegally modifies a context's CXT\_STS.state or context valid bits (CXT\_L2\_ENT.vl, CXT\_L1\_ENT.vl, and CXT\_CTL.vl) in memory, the context operation is architecturally undefined. Because the SDXI function may cache context control state and may still be executing outstanding operations issued before the illegal modification, the inconsistent or stale context control state may result in unexpected context operation. This can include the use of stale data, logged context errors, and an unexpected change of CXT\_STS.state due to context errors or previous outstanding administrative operations on the context.

Note that the effects of such undefined context operation are limited to only the results that may be produced by any valid context state. Furthermore, no other context nor the SDXI function shall be affected when a particular context has undefined operation.

Although correctly written privileged software shall never cause context-undefined operation, there is no method for software to detect it after the fact. In addition, for the benefit of performant SDXI implementations, the SDXI architecture does not require nor recommend that an SDXI function attempt to detect context-undefined operation caused by incorrect privileged software.

Context-undefined operation is exited when the context is stopped by any operation including DSC\_CXT\_STOP operations, function stop operations, and StopErr:Cxt (context or descriptor error actions).

# 4.3 Function and Context Operations

### 4.3.1 SDXI Memory-Based Data-Structure Hierarchy and Caching

SDXI-defined memory-based data structures are organized into a hierarchy of levels for the purposes of function and software management. The validity of a data structure is based on its explicit "valid (vI)" field in memory, if present, and its inherited validity derived from the validity of all higher levels in the hierarchy that point to it. A structure lacks inherited validity if any of these higher levels are invalid. (For example, a context level 1 entry can only be considered valid if its explicit CXT\_L1\_ENT.vI is "1", its associated higher level CXT\_L2\_ENT.vI is "1", and the SDXI function is at GSV\_ACTIVE or GSV\_STOPG\_[SF, HD].)

The SDXI function may cache these memory-based data structures using private caches that are not coherent with respect to processor caches. Data in both the valid and invalid states may be cached in this manner across all contexts within the function.

The memory-based data-structure hierarchy (smallest to largest) and its suitability for caching is shown below.

- 1. **Lowest-Level, Non-Cached Data:** the Write\_Index value and descriptors not located between Read\_Index and Write\_Index in the context descriptor ring. The validity of these structures is inherited from the validity of their associated CXT\_CTL.
- 2. Lowest Level, Cached Data: May be cached by the SDXI implementation.
  - a. LVL\_CXT\_STS: Single Context Status (CXT\_STS) entries and their fields. May be cached only when CXT\_STS.state is CXTV\_RUN or CXTV\_STOPG\_[SW, FN]. The validity of the structure is inherited from the validity of its CXT\_CTL.
  - b. **LVL\_AKEY**: Single AKey Table Entries. The validity of this structure is derived from its explicit "valid" field and the validity of its associated CXT\_L1\_ENT.
  - c. **LVL\_RKEY**: Single RKey Table Entries. The validity of this structure is derived from its explicit "valid" field and the validity of LVL\_FN.
  - d. **LVL\_DESCR:** descriptors with the valid (vI) field set to "1" that are located between Read\_Index and Write\_Index. The validity of such descriptors is further qualified by the validity of the associated CXT\_CTL.
- 3. LVL\_CXT\_CTL: Single Context Control (CXT\_CTL) entries and all subsidiary structures. This includes LVL\_CXT\_STS and LVL\_DESCR. These structures may be cached. The validity of this level is derived from the explicit CXT\_CTL.vI field and the validity of its associated CXT\_L1\_ENT.
- 4. LVL\_L1: Single CXT\_L1\_ENT entries and all subsidiary structures. This includes the LVL\_CXT\_CTL and all LVL\_AKEY associated with the CXT\_L1\_ENT. These structures may be cached. The validity of this level is derived from the explicit CXT\_L1\_ENT.vl field and the validity of its associated CXT\_L2\_ENT.
- 5. **LVL\_L2**: Single CXT\_L2\_ENT entries and all subsidiary structures. This includes all LVL\_L1 associated with the CXT\_L2\_ENT. These structures may be cached. The validity of this level is derived from the explicit CXT\_L2\_ENT.vl field and the validity of its associated LVL\_FN.
- LVL\_FN: All memory-based fields of the function. This includes all LVL\_L2 and LVL\_RKEY. These structures may be cached as described above. Explicitly valid when the SDXI function is at GSV\_ACTIVE or GSV\_STOPG\_[SF, HD].

#### 4.3.1.1 SDXI Update Operations for Modified Data Structures

When software has modified a memory-based data structure that may be privately cached by the function, software shall appropriately signal the function to update its internal copies of that data structure, if any. The SDXI signaling operations and mechanisms used by software for this purpose are described in the list below. The software procedures using these operations are described in *"4.3.1, SDXI Memory-Based Data-Structure Hierarchy and Caching"* 

- 1. DSC\_UPD\_[CXT, AKEY, RKEY, FN] descriptor operations. These signal the function to start the updating of all internal copies of a memory-based data structure and its subsidiary structures. The operations are scoped to match the hierarchy of memory-based data-structures described in "4.3.1, SDXI Memory-Based Data-Structure Hierarchy and Caching".
  - a. The SDXI implementation may perform the update by re-acquiring a copy of the data from system memory or by simply invalidating all internal copies of the data in its private caches. Software shall not rely upon how the update is performed.
  - b. Note that a successful completion of a DSC\_UPD operation only indicates that background update actions shall be started; it does not indicate that the update actions have completed nor that their effects are visible. Software shall use the DSC\_SYNC operation to ensure that the background update actions have completed.
- 2. DSC\_SYNC.[CXT, AKEY, RKEY, FN] descriptor operation. This operation synchronizes with the completion of the appropriately scoped background update actions initiated by previous DSC\_UPD operations. Once the function completes a DSC\_SYNC operation, the function shall have performed all associated updates.
- Context Stop. Whenever a context is stopped, the SDXI function shall invalidate all internal copies of the context's CXT\_STS and ensure that the function excludes re-acquiring a copy of CXT\_STS until the context is once again at CXTV\_RUN.
- 4. A local DSC\_CXT\_STOP operation with cxt\_start = 0 and cxt\_end = 0 shall invalidate all LVL\_L2 data associated with context 0 and ensure that the function excludes re-acquiring a copy of the LVL\_L2 data until context 0 is once again at CXTV\_RUN.

Note, that for any requested level of the memory-based data-structure hierarchy to update, an SDXI implementation may update (or invalidate and exclude) any associated higher level of the hierarchy.

#### 4.3.1.2 Software Procedure For Modifying Memory-Based Data Structures

When software modifies a memory-based data structure that can be privately cached by the SDXI function, it shall follow the procedures described in this section to ensure that the modification is performed correctly.

#### Procedure For Modifying A Memory-Based Data Structure:

- 1. If the data structure is not allowed to be cached by the SDXI function, software may modify it without restriction and no additional steps are required.
- 2. If a data structure does not currently possess inherited validity, software may modify it without restriction. Software then proceeds to step 5 in this flow.
  - a. If a hierarchy of structures are being transitioned from invalid to valid, software should update the hierarchy from the smallest scope to the largest to benefit from this fact.
- 3. If a data structure with an explicit valid field has inherited validity and the modification will transition the explicit field from "invalid" to "valid", software shall ensure that only the last (terminal) write to the structure causes the transition. Furthermore, software shall ensure that all earlier writes to the structure, if any, are made globally visible before such a terminal write. Software then proceeds to step 5 in this flow.
  - a. Example: for a valid context, transitioning an unused AKEY entry from invalid to valid.
- 4. The following steps apply to a data structure that is considered valid -- it has inherited validity and its explicit valid field, if present, is true. (See "4.3.1, SDXI Memory-Based Data-Structure Hierarchy and Caching").
  - a. If the data structure is encompassed by LVL\_CXT\_CTL and the CXT\_CTL is explicitly valid, software shall only modify the structure when the associated CXT\_STS.state is CXTV\_STOP\_[SW, FN] or CXTV\_ERR\_FN. Software then proceeds to step 5 in this flow.
    - i. Example: modifying CXT\_CTL.cxt\_sts\_ptr for a valid context.
  - b. For all other data structures, if the-structure lacks an explicit valid field, software shall only modify it after software has properly invalidated the data structure that points to it (the hierarchy level above it). Software then proceeds to step 5 in this flow.
    - i. Example: modifying CXT\_L1\_ENT.cxt\_ctl\_ptr for a valid context requires that CXT\_L1\_ENT.vl be set to invalid.
  - c. For all other data structures, if the structure has an explicit valid field set to "valid", and software ensures that the structure is not actively in use by any context nor the SDXI function, software shall ensure that the first write to the structure transitions it to an invalid state before further modification. Furthermore, software shall ensure that this write shall be made globally visible before any subsequent write to the structure. Software then proceeds to step 5 in this flow. Further notes below.
    - i. A software mechanism must exist for how software ensures that a data structure outside of LVL\_CXT\_CTL is not in use. That mechanism is outside the scope of this specification.
    - ii. Example: invalidating a valid CXT\_L1\_ENT requires that software ensure that it is not in use before the invalidation.
    - iii. Example: invalidating a valid AKEY\_ENT requires that software ensure that it is not in use before the invalidation.

#### Update and Synchronization Of A Modified Memory-Based Data Structure:

- 5. Software shall issue a DSC\_UPD operation targeting the modified data-structure hierarchy level. Software may target a higher level for the update, but shall never target a lower level below the actual modified data structure.
  - a. Example: For a valid CXT\_L2\_ENT, modifying a member CXT\_L1\_ENT requires software to issue at least a DSC\_CXT\_UPD.L1 operation, but DSC\_CXT\_UPD.L2 is also allowed.
  - b. Example: For a valid CXT\_L1\_ENT, modifying a member AKEY\_ENT requires software to issue at least a DSC\_AKEY\_UPD operation, but DSC\_CXT\_UPD.L1 and DSC\_CXT\_UPD.L2 are also allowed.
- 6. Software shall then issue a DSC\_SYNC operation targeting the modified data-structure hierarchy level and wait for its completion. Software may target a higher level for the update but, shall never target a lower level below the actual modified data structure.
  - a. Example: For a valid CXT\_L2\_ENT, modifying a member CXT\_L1\_ENT requires software to issue at least a DSC\_SYNC.CXT operation, but DSC\_SYNC.FN is also allowed.
  - b. Example: For a valid CXT\_L1\_ENT, modifying a member AKEY\_ENT requires software to issue at least a DSC\_SYNC.AKEY operation, but DSC\_SYNC.CXT and DSC\_SYNC.FN are also allowed.

## 4.3.2 Check Valid Context

When an SDXI function is starting or stopping a context, the function shall check that the key memory data structures of the context are read accessible, and enabled or valid. This check is referred to as the "ChkValid:Cxt" action. When the check fails in the flow below, no further steps in the flow are performed, and the fail signature returned shall be either LogErr:Cxt or Invalid:Cxt (see "3.4, Error Log" for details.)

- 1. The function ensures the following; if any are not true, then the check fails with LogErr:Cxt and these set of steps are exited.
  - a. MMIO\_CTL2.max\_cxt is less than or equal to MMIO\_CAP1.max\_cxt.
  - b. The context number is less than or equal to MMIO\_CTL2.max\_cxt.
- 2. The function shall ensure that it is not using stale values of the following context structures:
  - a. Context Status (which includes ContextState and Read\_Index) and Write\_Index.
- The function shall access and validate the following list of structures. The function shall use naturally-aligned atomic accesses to read these structures from memory if not cached. If the function cannot access a specified structure, then the check fails with LogErr:Cxt and these set of steps are exited.
  - a. Access the valid bit for the context level 2 table entry (CXT\_L2\_ENT.vI) that corresponds to the context number; if CXT\_L2\_ENT.vI is "0" then the check fails with Invalid:Cxt and these set of steps are exited.
  - b. If CXT\_L2\_ENT.vI is "1", access CXT\_L2\_ENT.LvI\_1\_ptr and use it to access the valid bit for the context level 1 table entry (CXT\_L1\_ENT.vI) that corresponds to the context number. If CXT\_L1\_ENT.vI is "0" then the check fails with Invalid:Cxt and these set of steps are exited.
  - c. If CXT\_L1\_ENT.vl is "1", access CXT\_L1\_ENT.pv, CXT\_L1\_ENT.cxt\_pasid, and CXT\_L1\_ENT.cxt\_ctl\_ptr. Use these to access the valid bit for the Context Control (CXT\_CTL.vl). If CXT\_CTL.vl is "0" then the check fails with Invalid:Cxt and these set of steps are exited.
  - d. If CXT\_CTL.vl is "1", verify access to these memory locations:
    - i. The starting memory address pointed to by CXT\_CTL.ds\_ring\_ptr,
    - ii. The starting memory address pointed to by CXT\_CTL.cxt\_sts\_ptr,
    - iii. The starting memory address pointed to by write\_index\_ptr.
    - iv. If CXT\_CTL.vl is "1", read CXT\_STS.state; if it is a reserved encoding then the check fails with LogErr:Cxt and these set of steps are exited.

If the check fails, the function shall invalidate and exclude the caching of the context's Context Status (CXT STS) and take other operation-specific actions as required.

## 4.3.3 Doorbell Register and Context Signaling

As new descriptors are written to a descriptor ring, software updates the context's Write\_Index location in system memory as appropriate. Because the SDXI function operates asynchronously with respect to a context and the Write\_Index location is in system memory, software shall directly signal the function after updating Write\_Index and the descriptor ring to ensure new descriptors are evaluated. Software signals the function by ensuring the update of the final Write\_Index value, the doorbell\_value, to a per-context, MMIO-mapped, architectural Doorbell register when the context is in CXTV\_RUN. (See "9.7, Doorbell Sections and Registers".) The Doorbell register may be updated by software by the following mechanisms.

- Externally updated when the context is in CXTV\_RUN and software writes a doorbell\_value to the context's Doorbell register.
- Internally updated when the context is transitioned to the CXTV\_RUN state by software issuing for the target context a DSC\_CXT\_START\_[NM, RS] operation that specifies a doorbell\_value.

The Doorbell register may also be internally updated by the function when the context is in CXTV\_RUN and an SDXI implementation optionally chooses to read Write\_Index and Read\_Index from memory. Regardless of the SDXI implementation, software shall always issue Doorbell register writes when updating Write\_Index in order to ensure correct operation.

The SDXI function evaluates a doorbell\_value by the following ordered steps.

- 1. If the context is not in CXTV\_RUN state at the time the Doorbell register is implicitly or explicitly written, the doorbell\_value is ignored and the function logs no error and takes no action based on it; no further steps are done. Otherwise, proceed to the next step.
- 2. If the doorbell\_value is supplied by a DSC\_CXT\_START\_[NM, RS] operation and the value is 0xFFFF\_FFFF\_FFFF\_FFFF, then the function shall discard this value and fetch the doorbell\_value from the context's Write Index location.
- 3. If the doorbell\_value is supplied by any other method, then it is implementation-dependent if the function uses the supplied doorbell\_value or discards it and fetches doorbell\_value from the context's Write Index location.
- 4. If the doorbell value is less than or equal to previous valid doorbell\_value values received since the context was started, the function ignores the doorbell\_value and takes no action. (This determination is reset every time the context is started.)
- 5. If the doorbell\_value is less than or equal to the context's Read\_Index value, it is implementationdependent whether the function accesses the context's descriptor ring.
- 6. If the doorbell\_value is greater than the context's Read\_Index value, the function will examine the context's descriptor ring starting at Read\_Index for descriptors to execute.
- 7. It is implementation-dependent whether the function uses cached values of context data structures in determining the location of the Read\_Index and Write\_Index locations in memory.
- 8. It is implementation-dependent whether the function logs an error if any part of the context needed to access the Read\_Index and Write\_Index locations is invalid or inaccessible.

There is no time requirement as to when the function acts on a supplied doorbell\_value supplied by the above methods.

Software shall make no assumptions as to the implementation of the Doorbell register.

## 4.3.4 Starting A Context and Context Signaling

A context must be in the CXTV\_RUN state for an SDXI function to evaluate it for new descriptors to execute. Furthermore, because of the asynchronous nature of an SDXI function, the function must be signaled when a CXTV\_RUN context has new descriptors to execute. Each of these is an independent mechanism.

- Starting a Context: this is the mechanism of transitioning a valid context's CXT\_STS.state to CXTV\_RUN. Starting a context does not ensure that the function evaluates it for new descriptors to execute.
- Context Signaling: this is the mechanism of signaling the function that a context should be evaluated for new descriptors to execute using the specified doorbell\_value. The signaling does not imply that the context is valid or at CXTV\_RUN; the SDXI function will determine this independently and take appropriate action.

Building on the above concepts, there are several methods for software to start a target context within a target function and ensure that new descriptors are evaluated thereafter in a timely manner. Each of these may be combined with context signaling to ensure that new descriptors are evaluated by the function

- 1. Software issues a DSC\_CXT\_START\_[NM, RS] operation from the function's administrative context to start a context in the same function using an appropriate doorbell\_value. (See "6.6.3, AdminGrp DSC\_CXT\_START\_[NM, RS] Operations") This is the preferred method for all contexts except context 0, the administrative context.
- 2. From the PF, software issues a DSC\_CXT\_START\_[NM, RS] operation targeting another function's context with an appropriate doorbell\_value. This method is intended primarily for hypervisor usage.
- 3. "Jump Start 1": set the target context's CXT\_STS.state in memory to CXTV\_RUN when in the appropriate state (See "4.3.1, SDXI Memory-Based Data-Structure Hierarchy and Caching") and then write an appropriate doorbell\_value to the context's Doorbell register. (See "4.3.3, Doorbell Register and Context Signaling") This is the recommended way within a function to start context 0, the administrative context.

Using the DSC\_CXT\_START\_[NM, RS] operation ensures that the function is not using stale values of key context structures. (See "6.6.3, AdminGrp DSC\_CXT\_START\_[NM, RS] Operations")

Using a "Jump Start" method requires that software explicitly ensure there are no in-flight doorbell writes before changing CXT\_STS.state to CXTV\_RUN, and that there is no stale context data before writing the doorbell register. Stale context data for a valid context may be avoided by using DSC\_UPD and DSC\_SYNC operations against the function or specific context (see "6.6.9, AdminGrp DSC\_SYNC Operation"). In the case of context 0, stopping the SDXI function ensures stale context and in-flight doorbells are flushed (see "3.5, Administrative Context (Context 0)"). When a function transitions from GSV\_STOP to GSV\_ACTIVE state, the function shall have no stale context data cached for that context. Software shall ensure that the function's context 0, the administrative context, is started before operating on other function contexts.

## 4.3.5 Function and Context Stop Actions

An SDXI function uses context stop actions to stop processing of new descriptors and to wait for previously issued descriptors to complete for targeted contexts. A stop action shall complete in a timely manner so that further administrative operations on the targeted contexts may be performed. The following SDXI context-stop actions are defined below.

- LOC.DSC\_CXT\_STOP.[SF, HD]: This is a software-initiated, local-function, administrative DSC\_CXT\_STOP operation. The operation's Hard Stop (HS) bit indicates "SF" when HS = 0, and "HD" when HS = 1. The function shall invalidate and exclude the caching of the Context Status (LVL\_CXT\_STS). See "4.3.1, SDXI Memory-Based Data-Structure Hierarchy and Caching".
- P2V.DSC\_CXT\_STOP.[SF, HD]: This is a software-initiated, PF-to-VF-function, administrative DSC\_CXT\_STOP operation. The operation's Hard Stop (HS) bit indicates "SF" when HS = 0, and "HD" when HS = 1. The function shall invalidate and exclude the caching of the Context Status (LVL\_CXT\_STS). See "4.3.1, SDXI Memory-Based Data-Structure Hierarchy and Caching".
- 3. **StopErr:Cxt**: The function detects an error: in processing or executing a descriptor; or in context operation. In this case the function shall initiate a background context stop action (StopErr:Cxt) and log a descriptor error (LogErr:Cxt). If the error relates to a specific descriptor, the function shall signal an error in the descriptor's completion status block (Signal\_Err\_Csb). The function shall invalidate and exclude the caching of the Context Status (LVL\_CXT\_STS).
- 4. **Stop[SF, HD]:Fn**: The function performs a stop action in response to a software-initiated stop of the function. (Software initiates stopping the function by writing GSRV\_STOP\_[SF, HD] to MMIO\_CTL0.fn\_gsr.) The function shall invalidate all cached memory-based data (LVL\_FN). The function state will change based on this action; it is discussed in *"4.1.1, G0: GSV\_STOP State"*.

**Note:** HaltErr:Fn is a function-initiated halt and abort of all function activity in response to a functionwide error; it is not a stop-action. See "4.1.6, G5: GSV\_ERROR State" for more details.

Stop actions apply one of the below completion-wait policies to each target context.

- **Soft-Stop (SF) Wait**: The function shall wait until all outstanding descriptors complete naturally or until an implementation-defined timeout period has expired at which point any remaining descriptors are aborted. The DSC\_CXT\_STOP.SF, StopSF:Fn, and StopErr:Cxt actions apply this policy
- Hard-Stop (HD) Wait: The function waits less patiently for all outstanding descriptors to complete; and may abort outstanding descriptors more aggressively to speed up the completion wait. Used as a last resort to stop the context. The DSC\_CXT\_STOP.HD and StopHD:Fn actions apply this policy.

When an outstanding descriptor is aborted because of a context stop action: its completion status indicates an error; an error is logged; and CXT\_STS.state shall be set to CXTV\_ERR\_FN.

Because of the asynchronous nature of the SDXI function, the function may be performing multiple background stop actions concurrently against the same context or across different contexts. Some examples follow.

- Software may initiate a subsequent stop operation with hard-stop completion if a previous stop operation with soft-stop completion is not making acceptable forward progress.
- All of the following can be concurrent on the same context: a StopErr:Cxt initiated by the function; a LOC.DSC\_CXT\_STOP operation submitted by software administering a VF; and a Stop:Fn action requested by a hypervisor managing the function.

Unless these actions are explicitly synchronized by software, the SDXI function enforces no ordering among them. However, for a single context, the function ensures the following.

- If the context is already at CXTV\_ERR\_FN or CXTV\_STOP\_[SW, FN], subsequent stop actions are ignored.
- If the context is already at CXTV\_STOPG\_[SW, FN], future stopping actions may only strengthen the completion wait policy to hard-stop.
- Only one stop action can transition a context from CXTV\_RUN to CXTV\_STOPG\_[SW, FN].

For each context targeted by a given stop-action operation, the function executes the following ordered set of steps below; the steps are also shown in *"Figure 4-3: Context Stop Action Flow"*.

- K0. The function records the requested completion-wait policy (req\_wait) specified by the stop action.
  - a. The function records req\_wait as "SF", if the stop action is DSC\_CXT\_STOP.SF, StopSF:Fn, or StopErr:Cxt.
  - b. The function records req\_wait as "HD", if the stop action is DSC\_CXT\_STOP.HD or StopHD:Fn.
- K1. The function shall perform a Check Valid Context action (ChkValid:Cxt) described in "4.3.2, Check Valid Context".
- K2. If the check fails, the function shall end this stop action on the context and take further action described below.
  - a. If the stop action is Stop[SF, HD]:Fn, take no further action.
  - b. If the stop action is a StopErr:Cxt, initiate a HaltErr:Fn -- because the function should not initiate a StopErr:Cxt on a context that fails ChkValid:Cxt.
  - c. If the stop action is a DSC\_CXT\_STOP with a ChkValid:Cxt failure signature of LogErr:Cxt, then log the appropriate error.
  - d. If the stop action is a DSC\_CXT\_STOP with a ChkValid:Cxt failure signature of Invalid:Cxt, take no further action.
- K3. If the context is already at CXTV\_STOP\_[SW, FN] or CXTV\_ERR\_FN, the function shall end this stop action on the context.
- K4. If the context is not yet at CXTV\_STOPG\_[SW, FN] (which means that another stop-action has not yet reached this context), the function shall proceed to step K6. Otherwise, the function shall continue to the following step.
- K5. The function shall set the current completion-wait policy (curr\_wait) to the requested completionwait policy (req\_wait) as described below, and end this stop action on the context.
  - a. If req\_wait is "HD" (hard completion wait), the function shall ensure that curr\_wait is set to "HD".
  - b. If req\_wait is "SF" (soft completion wait), the function shall not change curr\_wait. (Thus, once a hard completion wait policy is applied, it cannot be overridden.)
- K6. At this point, only one stop action shall be operating on the context. The function shall atomically write a "stopping" value to CXT\_STS.state as described below, and then continue to the following step.
  - a. The function shall atomically write CXTV\_STOPG\_SW, if the stop action is a LOC.DSC\_CXT\_STOP.[SF, HD].
  - b. The function shall atomically write CXTV\_STOPG\_FN, if the stop action is a P2V.DSC\_CXT\_STOP.[SF, HD], Stop[SF, HD]:Fn, or StopErr:Cxt.
- K7. The function shall terminate context execution at a descriptor boundary and initialize the context's current completion-wait policy (curr\_wait) to the requested completion-wait policy (req\_wait).
- K8. The function then performs the stopping of the context using the current completion-wait policy (curr\_wait). While the function is performing this step, any change of the context's curr\_wait to

"HD" by other stop actions on the same context shall take effect during this step at an implementation-defined point.

- K9. The function returns to step K8 until one of the following mutually exclusive conditions occur.
  - a. The context is stopped: the function goes to step K10.
  - b. The function detects an implementation failure in stopping the context: the function initiates a HaltErr:Fn action and shall then complete this stop action on the context.
- K10. The function shall invalidate and exclude all internal copies of the following data structures and ensure that the function excludes re-acquiring a copy of these data structures until the context is once again at CXTV\_RUN. If successful, the function goes to step K11. If unable to invalidate and exclude copies of the specified data structures due to an implementation error, the function initiates a HaltErr:Fn action and shall then complete this stop action on the context.
  - a. The context's CXT\_STS.
  - b. The associated LVL\_L2 for context 0.
- K11. If the stop action is StopErr:Cxt or a context error was detected during the stopping of the context, the function goes to step K13.
- K12. The function shall atomically write a "stopped" value to CXT\_STS.state as described below, and then complete this stop action on the context.
  - a. The function shall atomically write CXTV\_STOP\_SW, if the current CXT\_STS.state is CXTV\_STOPG\_SW.
  - b. The function shall atomically write CXTV\_STOP\_FN, if the current CXT\_STS.state is CXTV\_STOPG\_FN.
- K13. The function shall log the appropriate error (LogErr:Cxt) and then atomically write "CXTV\_ERR\_FN" to CXT\_STS.state. The function shall then complete this stop action on the context.

Although not shown in the above per-context stop steps, when the function performs a Stop[SF, HD]:Fn action, it shall invalidate all privately-cached memory-based data (LVL\_FN) when it completes the operation.

There may be a long latency from when a context stop action is initiated by privileged software to when the context is actually stopped by the function due to many descriptors that may still be in execution. If privileged software knows in advance that it will later stop the context, it may hint to the function earlier that it should start "draining" (stop executing new descriptors) the context by clearing the context's CXT\_L1\_ENT.ka (Keep Active Hint) to "0". This may reduce the time that software must wait for stopping the context later.

Figure 4-3: Context Stop Action Flow


## 4.3.6 Context Ring Submission Hint

When privileged software is momentarily stopping a context to change privileged state, it may want to hint to other software that new descriptors may still be submitted to a context during the interim. This hint may minimize the impact to software using the context while the context is stopping or stopped. To ensure a consistent software implementation, SDXI defines the ring submission hint (rsh) in the Context Status (CXT\_STS.rsh). This bit is solely used by software and has no impact on the SDXI function, which never uses nor writes it.

The ring submission hint is intended to only be valid to software when the CXT\_STS.state has a value of CXTV\_STOPG\_SW or CXTV\_STOP\_SW; it should be ignored otherwise by software. Software shall write CXT\_STS.rsh with a byte-sized access only; this avoids conflicting writes by the function to CXT\_STS.state, which is in a different byte.

- When CXT\_STS.rsh = 1, software may continue to submit new descriptors to the context.
- When CXT\_STS.rsh = 0, software is encouraged to not submit new descriptors. See the below table for intended operation.

| CXT_STS.rsh<br>(bit 8) | CXT_STS.state<br>(bits 3:0) | Definition                                                                                          |
|------------------------|-----------------------------|-----------------------------------------------------------------------------------------------------|
| 0                      | 0b0000                      | Context is in CXTV_STOP_SW more permanently. Software is encouraged to not submit new descriptors.  |
| 0                      | 0b0010                      | Context is in CXTV_STOPG_SW more permanently. Software is encouraged to not submit new descriptors. |
| 1                      | 0b0000                      | Context is in CXTV_STOP_SW temporarily. Software may submit new descriptors during the interim.     |
| 1                      | 0b0010                      | Context is in CXTV_STOPG_SW temporarily. Software may submit new descriptors during the interim.    |
| Х                      | 0b0001                      | Context is in CXTV_RUN. Software may submit new descriptors.                                        |
| Х                      | Any Other Value             | Software may not submit descriptors.                                                                |

| Table 4–1: | CXT | _STS.rsh | Mappings |
|------------|-----|----------|----------|
|------------|-----|----------|----------|

Software may use this procedure to determine if it should submit new descriptors for a context:

```
// Let csx = CXT_STS[8:0]
// Let rsh = csx & 0b100000000
// Let cse = csx & 0b1111
check0 = ( cse == CXTV_STOP_SW || cse == CXTV_STOPG_SW );
if ( rsh && check0 || cse == CXTV_RUN ){
    submit_work();
}
```

Privileged software may use many approaches to configuring CXT\_STS.rsh. A simple approach is for privileged software to set CXT\_STS.rsh to "1" before starting the context for normal usage and to leave it unchanged afterwards until the context is no longer available for normal usage. Thus, CXT\_STS.rsh never needs to be changed during the small interims when privileged software is modifying privileged context state.

## 4.4 Atomic Operation Support

The usage of atomic operations by the SDXI function depends on both the SDXI function supporting those operations ("function-supported atomic operations") and the capability and configuration of the processor and platform interface connections to the SDXI function to support those operations ("interface-supported atomic operations"). Privileged software shall verify both before exposing an atomic operation to an SDXI context. The following section gives more detail.

An SDXI implementation may support none, some, or all of the following function-supported atomic operations; each set of operations has an enumeration bit in the SDXI MMIO register space.

- 1. Atomic accesses to the the completion status block (CST\_BLK) associated with an SDXI descriptor operation; enumerated by MMIO\_CAP0.cs\_cap. See "4.4.1, Completion-Status Capabilities".
- 2. The full set of descriptors in the atomic operations group; enumerated by MMIO\_CAP1.opb\_000\_cap[3].
- 3. The minimal set of descriptors in the atomic operations group; enumerated by MMIO\_CAP1.opb\_000\_cap[5].

An SDXI implementation may connect to its hosting hardware platform and other SDXI functions through one or more connection interfaces; examples include but are not limited to PCIe, CXL, and proprietary interfaces. Interface-supported atomic operations to and from the SDXI function may not exist or may not be configured. Platform and interface-specific mechanisms are required to determine if the capability is available and enabled. For example, see "*8.4, PCIe Atomic Capabilities Discovery and Enablement*" for details of how privileged software can determine this for an SDXI function using a PCIe interface.

Privileged software shall verify **both** of the following before exposing an atomic operation to an SDXI context: interface-supported atomic operations are supported and enabled between the SDXI function and any relevant target; both the SDXI function and any relevant target have the relevant function-supported atomic operations.

### 4.4.1 Completion-Status Capabilities

Each SDXI descriptor operation may specify a completion status block ("CST\_BLK") whose fields are used by the SDXI function to update (modify) the descriptor operation's completion status and report relevant errors. The producer of the descriptor initializes the CST\_BLK before submitting the descriptor. (See *"6.1.2, Completion Status Block"*.)

The SDXI function shall support modifying the CST\_BLK with respect to other accesses either atomically or non-atomically; it may also support both capabilities and allow the producer to choose between the two using a descriptor control flag described in *"4.4.2, Completion-Status Modes"*. The SDXI function reports these capabilities in the MMIO\_CAP0.cs\_cap field as shown in the below table The details of the capabilities follow in the list below.

| MMIO_CAP0.cs_cap | Completion Status Capability                                             |
|------------------|--------------------------------------------------------------------------|
| 00b              | Atomic completion-status capability supported.                           |
| 01b              | Reserved                                                                 |
| 10b              | Both atomic and non-atomic completion-status capabilities are supported. |
| 11b              | Non-atomic Completion-Status capability supported.                       |

### Table 4–2 Completion-Status Capabilities

1. Function: Atomic-Completion-Status Capability: the SDXI function supports atomic modification of the CST\_BLK. when MMIO\_CAP0.cs\_cap is "00b" or "10b".

- The function shall perform modification of the CST BLK and decrementing of the a. CST BLK.signal field atomically with respect to other accesses of the same. The capability depends on interface-supported atomic operations.
- An atomic decrement of 0 in CST BLK signal results in 0xFFFF FFFF FFFF FFFF. b.
- If the function only supports the atomic completion-status capability but does not have access C. to interface-supported atomics, then privileged software must not enable context execution for the SDXI function.
- Function: Non-Atomic Completion-Status Capability: the SDXI function supports modification of the 2. CST BLK without using atomic operations when MMIO CAP0.cs cap is "10b" or "11b".
  - The function shall perform the following without using atomic operations: modification of the a. CST BLK; and transitioning the CST BLK.signal to a terminal value of "0". The function shall not depend on interface-supported atomic operations to implement the capability. The function may not choose to evaluate the initial value of CST\_BLK.signal.
  - If the producer does not ensure that the initial value of CST BLK.signal is "1", the function b. may return any terminal result in CST BLK.signal; however, it will not generate an error.

#### 4.4.2 **Completion-Status Modes**

Based on the completion-status capabilities described in section "4.4.1.Completion-Status Capabilities". privileged software determines the related, but not identical, completion-status mode availability for each context. It exposes the availability of these modes through CXT CTL.csa ("completion-status mode availability"); note, however, that this field is informational only and not enforced by the SDXI function. Privileged software shall also communicate and negotiate the available modes to the producer for a given context. This communication is outside the scope of the SDXI specification, but a suggested mechanism is shown in "8.4, PCIe Atomic Capabilities Discovery and Enablement". The following discussion describes the availability modes and how they are used.

| CXT_CTL.csa | Definition                                                                                         |
|-------------|----------------------------------------------------------------------------------------------------|
| 0b          | Both atomic completion-status mode and simple completion-status mode are available to the context. |
| 1b          | The context shall only use simple completion-status mode.                                          |

| Table 4–3 | Completion-Status Mode Availability |
|-----------|-------------------------------------|
| Definit   | lan                                 |

| Table 4–4 | <b>Completion-Status</b> | Mode | Requirement |
|-----------|--------------------------|------|-------------|
|           | oumpletion-otatus        | mouc | Neguiement  |

| Descriptor "csr" field | Definition                                                  |
|------------------------|-------------------------------------------------------------|
| 0b                     | The descriptor requires atomic completion-status mode.      |
| 1b                     | The descriptor requires only simple completion-status mode. |

3.

- 1. Atomic Completion-Status Mode: the producer configures and requires descriptor completion using the atomic completion-status capability.
  - a. Privileged software indicates the availability of this mode to a context by setting CXT\_CTL.csa to "0"; but shall only do so when the function has both atomic completion-status capability (MMIO\_CAP0.cs\_cap is "00b" or "10b") and access to interface-supported atomic operations. If the mode is not available, but the context requires the mode, privileged software cannot enable execution of the context.
  - b. When the mode is exposed to a context, the producer shall set a descriptor's "csr" ("completion status requirement") field to "0" if the atomic completion-status capability is required for the CST\_BLK.
  - c. There is no function-enforced terminal value for CST\_BLK.signal in this mode; however, it is recommended that the producer use "0" as the terminal value.
  - d. When using this mode for a descriptor, the producer may update the CST\_BLK atomically even after the associated descriptor has been issued.
  - e. If an SDXI function does not support the atomic completion-status capability, the function shall abort and log an error for a descriptor with the "csr" field set to "0".
  - f. When atomic CST\_BLK access is not required, this mode is not recommended; use the simple completion-status mode instead for greater compatibility across all SDXI implementations.
- 2. Simple Completion-Status Mode: the producer configures and requires descriptor completion in a manner that complies with the non-atomic completion-status capability, but which may be satisfied by either completion capability.
  - a. This mode is always available for a context and is supported by both completion capabilities. Privileged software may recommend that the context use only this mode by setting CXT\_CTL.csa to "1". Privileged software must expose this mode when only non-atomic completion-status capability is supported by the SDXI function.
  - b. The producer uses this mode by setting a descriptor's "csr" ("completion status requirement") field to "1" to indicate that atomic completion-status of the CST\_BLK is not required. Furthermore, the producer shall set the initial value of CST\_BLK.signal to "1" before submitting the descriptor. Lastly, the producer detects completion when CST\_BLK.signal has the terminal value of "0".
  - c. The producer shall not rely upon atomic access of the CST\_BLK nor the manner in which the SDXI function modifies it.
  - d. The SDXI function shall use the non-atomic completion-status capability for this mode when available; otherwise, it shall use the atomic completion-status capability. This mode is the most compatible mode across all SDXI implementations.
  - e. This mode is preferred when both modes are available to the context (CXT\_CTL.csa is "0") and a descriptor does not require atomic CST\_BLK access.



Figure 5-1: An SDXI Function Context

An SDXI function provides programmed-data acceleration by reading and executing a series of memorybased, naturally-aligned, 64-byte "descriptors". Each descriptor's "opcode" field encodes a requested operation and the remainder of the descriptor specifies additional parameters. (Descriptor operations are further discussed in *"5.1, Descriptor Operations"*). Descriptors are placed in a circular ring buffer that starts at a specified address (CXT\_CTL.ds\_ring\_ptr). The ring is contiguous at the translation level configured for the SDXI function. The ring is configured to contain a given number of descriptor entries (CXT\_CTL.ds\_ring\_sz). The number of bytes allocated in memory for the ring is: (CXT\_CTL.ds\_ring\_sz \* 64).

The ring and all its related system memory data structures comprise a "context". SDXI uses a 2-level hierarchy of context tables (Context Table Level 2, Context Table Level 1) to enumerate the components of the context. The concatenation of the offsets used to enumerate a context in both Context tables yields the 16-bit "context\_number" that is associated with the context. Note that the Context Tables point to a context but are not themselves part of the context. An SDXI function can support multiple concurrent contexts. Contexts are classified into two types: an "unprivileged" type used directly by user applications for data movement; and "administrative" contexts that can be used by privileged software to control all contexts supported by a SDXI function.

An SDXI function may optionally generate DMA requests with PASID when accessing non-context data structures, as well as Context Control Entries and AKey Table Entries. This is controlled through the MMIO\_CTL0.fn\_pasid\_vI and MMIO\_CTL0.fn\_pasid fields.

A SDXI function may optionally generate DMA requests with PASID when accessing a context's descriptor ring, Write\_Index, CXT\_STS, and Completion Status Block structures. This is controlled through the CXT\_L1\_ENT.pv and CXT\_L1\_ENT.ctx\_pasid fields.

A SDXI function may optionally generate DMA requests with PASID when accessing data buffers. This is controlled through the associated AKEY\_ENT.pv and AKEY\_ENT.pasid fields.

A circular ring requires "start" and "end" indicators. Rather than using memory pointers to track these, a SDXI descriptor ring uses 64-bit \*unsigned\* logical indices to indicate the start (Read\_Index) and end (Write\_Index) of the descriptor ring. This simplifies various calculations for SW and HW alike. The logical indices need only be mapped onto descriptor ring addresses when writing or reading a ring entry at a given Index.

• An SDXI descriptor ring shall be contiguous within the address space used to access it. This is determined by the function's PCI requester ID and optional PASID.

CXT\_CTL.ds\_ring\_ptr indicates the start address of the ring within this address space.

The logical Read\_Index and Write\_Index indices map to ring addresses when writing or reading a ring entry at a given Index. An index N, maps to an address in the ring by:

• ring\_entry\_address = CXT\_CTL.ds\_ring\_ptr + ((N % CXT\_CTL.ds\_ring\_sz) << 6)

The indices are not expected to reach or exceed (2\*\*64)-1 in practice

After software increments Write\_Index and adds entries to the ring, it shall write the context's Doorbell register with the Write\_Index value; there is no architectural guarantee that new entries on the ring are processed without this important write.

If software wants to add N entries to the descriptor ring, it must ensure that:

• Write\_Index + N - Read\_Index <= CXT\_CTL.ds\_ring\_sz

For an enabled context, Read\_Index is constantly being updated by the SDXI function as new descriptors are processed. Any value read by software of the Read\_Index could be stale, but this is not a problem. Since Read\_Index always increases and never wraps in the normal case, a calculation using a stale value of Read\_Index will only cause a more conservative understanding of the number of available indices/entries by software. It will never allow enqueing more entries than the ring can hold.

If Write\_Index == Read\_Index, then the SDXI function does not process more entries of that context until Write\_Index and the context's doorbell value are updated beyond Read\_Index. The SDXI function will stop processing entries when Read\_Index == Write\_Index or Read\_Index points to an invalid entry. The SDXI function shall never process the entry pointed to by Write\_Index or any entry that is logically past Write\_Index, regardless of the value of the descriptor's Valid field.

For a valid context, software shall only modify valid descriptors located between the context's Read\_Index and Write\_Index when the associated CXT\_STS.state is CXTV\_STOP\_[SW, FN] or CXTV\_ERR\_FN.

## 5.1 Descriptor Operations

Each descriptor operation has an "opcode" field encoding and a specific format for the bytes that comprise it. The "opcode" field is subdivided into smaller subfields including "type" and "subtype". For the purposes of standardized enumeration and implementation, SDXI arranges sets of related operations into an operation group; each operation within a group has a shared "type" encoding combined with an unique "subtype" encoding. For example, all the operations of the DmaBaseGroup (DSC\_DMAB) have a "type" encoding of "0x001".

The enumeration and enabling for most operation groups are provided by a set of operation-group ("op\_grp") fields described below.

- 1. Enumerated support of the operation group by the SDXI function through MMIO\_CAP1.opb\_000\_cap.
- 2. Availability of the operation group for all SDXI function contexts configured by privileged software through MMIO\_CTL2.opb\_000\_avl.
- 3. Enabling of the operation group for an individual context configured by privileged software through CXT\_L1.opb\_000\_enb.

Using these fields, SDXI places each descriptor operation group into one of the following categories.

- 1. Mandatory, not-enumerable: all SDXI functions are required to implement the group which is always available and enabled; there are no assigned operation-group fields. Examples are DmaBaseGrp and AdminGrp.
- 2. Optional: an SDXI function may implement the operation group; if it does, it shall support the assigned operation-group fields. An example is the AtomicGrp.

When enumeration is defined and true for an operation group at a given bit position "x" in the operationgroup fields, an SDXI context may use the operation if the following below condition is true; if not, the SDXI function logs an error and stops execution of the context -- i.e., initiates a StopErr:Cxt action.

• MMIO\_CAP1.opb\_000\_cap[x] & MMIO\_CTL2.opb\_000\_avl[x] & CXT\_L1.opb\_000\_enb[x]

Privileged Software shall not set an operation-group bit in MMIO\_CTL2.opb\_000\_avl and CXT\_L1.opb\_000\_enb if the corresponding bit in MMIO\_CAP1.opb\_000\_cap is "0". The SDXI function may ignore such bits and software shall not rely upon their behavior.

The definition of descriptor opcodes, their formats, and their function operation-group fields are given in *"Chapter 6, SDXI Descriptor and Operation Specification"*.

## 5.2 Enqueuing one or more Descriptors

The following procedure may be used to enqueue one or more descriptors into a descriptor ring:

- 1. Check for sufficient space in the descriptor ring by reading the Read\_Index, Write\_Index and ring size.
- 2. Reserve space in the descriptor ring by adding a value to the Write Index in memory corresponding to the number of descriptors to be enqueued. In a multi-producer scenario, Write\_Index can only be safely updated using an atomic compare-and-swap operation. This operation also provides the required single-threaded guarantees with respect to the update of Write\_Index.
- 3. Let "PWI" refer to the pre-incremented Write\_Index. Then Descriptors may be written into memory starting at:

CXT\_CTL.ds\_ring\_ptr + (PWI % CXT\_CTL.ds\_ring\_sz )\*64.

Software shall ensure that the Valid field in a descriptor header is set to 0 (Invalid) until the whole descriptor is written. The function may start processing the whole descriptor immediately once the valid field is set to 1. The function may read the descriptor one or more times prior to the doorbell being written.

4. Once all descriptors are written to memory, write the updated Write\_Index value to the context's Doorbell register.

#### Figure 5-2: Example x86-64 SDXI Enqueue Code

/\* Example code for enqueueing descriptors on an SDXI ring using gcc 9.3 (and later).

The code should be correct for multiple CPU architectures as it relies solely on: the cpu-independent gcc singlethreaded guarantees for instruction re-ordering; gcc atomic built-ins which control compiler instruction-reordering and proper emitting of CPU memory barriers and instructions; and the behavior of the volatile type keyword to ensure that gcc does not optimize away or cache critical SDXI index locations. (Note: the code has been compiled at -O3 and the assembly examined on both x86-64 and ARM64 for correctness wrt these issues.)

Using a fence (\_\_atomic\_thread\_fence(\_\_ATOMIC\_ACQ\_REL)) at key points in the code ensures for any given thread that: the compiler doesn"t re-order instructions due to optimizations; and the compiler emits the minimum but necessary CPU memory barriers to ensure the required read and write ordering within the thread.

For example, on x86-64 with optimizations enabled, the fence compiles to nothing due to the TSO memory model; however, on ARM64 this compiles to "dmb ish". Using \_\_ATOMIC\_SEQ\_CST instead for the fence penalizes the emitted code in some cases -- e.g. mfence on x86-64 which is too strong.

In a multi-producer scenario, Write\_Index can only be safely updated using \_\_atomic\_compare\_exchange\_n(). This operation also provides the required single-threaded guarantees wrt the update of Write\_Index. Using \_ATOMIC\_SEQ\_CST here seems like the right generic choice which gcc compiles to the correct CPU instructions such as "lock cmpxchg" on x86\_64 and "ldaxr, cmp, bne, stlxr" on ARM64.

\*/

\*/

```
#define SDXI_DESCR_SIZE 64
#define SDXI_DS_NUM_QW (SDXI_DESCR_SIZE / sizeof(uint64_t))
#define _ALIGN64 __attribute__ ((aligned (64)))
#define SDXI_MULTI_PRODUCER 1 // Define to 0 if single-producer.
```

/\* Shows a method for writing an array of descriptors to the SDXI ring given a ring index.

Method: Skip writing the first QW of the first new entry to be added to the ring, but write all the other entry data as normal. This ensures that we don"t trigger the SDXI function into reading incomplete entries as we are writing them. Finally, write the first QW of the first new entry to the ring so the SDXI function can start processing the new entries.

This is safe provided that the zeroth bit of the LSB (VALID bit) of the first new ring entry is "0" \*before\* Write\_Index is advanced past it (since an SDXI function will not read past an invalid ring entry). This requirement is ensured by SW and the SDXI function in the following way: SW shall always initialize at least every 64th byte of the ring to zero before first use of the ring; and subsequently, the SDXI function shall always clear the zeroth bit of the LSB of every ring entry the function processes.

```
int update_ring(
   uint64_t * const enq_entries,
                                       // Ptr to entries to enqueue
   uint64_t enq_num,
                                       // Number of entries to enqueue
   uint64_t * ring_base,
                                       // Ptr to ring location
                                       // (Ring Size in bytes)/64
   uint64_t ring_size,
   uint64_t index )
                                       // Starting ring index to update
{
   for (uint64_t i = 0; i < eng_num; i++)
      uint64_t * ringp = ring_base + ((index + i) % ring_size) * SDXI_DS_NUM_QW;
      uint64_t * entryp = enq_entries + (i * SDXI_DS_NUM_QW);
      for (uint64_t j = 1; j < SDXI_DS_NUM_QW; j++ ){</pre>
         *(ringp + j) = *(entryp + j);
      }
   }
   // Now write the first OW of the new entries to the ring.
   __atomic_thread_fence(__ATOMIC_ACQ_REL);
   for (uint64_t i = 0; i < enq_num; i++)
      uint64_t * ringp = ring_base + ((index + i) % ring_size) * SDXI_DS_NUM_QW;
      uint64_t * entryp = enq_entries + (i * SDXI_DS_NUM_QW);
      *ringp = *entryp;
   }
   return 0;
}
```

/\* Enqueues entries onto an SDXI Ring.

We assume that the OS and/or the application have already setup the SDXI function's context and can pass the locations and sizes of the ring, Write\_index, Read\_Index, and Door\_Bell to this routine. Once Write\_Index can be safely advanced for the desired number of ring entries, this method does so and \*then\* afterwards populates the new ring entries.

```
*/
int sdxi_enqueue(
  uint64_t * const enq_entries,
                                             // Ptr to entries to enqueue
  uint64_t enq_num,
                                             // Number of entries to enqueue
  uint64_t volatile * const Door_Bell)
                                             // Ptr to Ring Doorbell location
{
  uint64_t read_idx, old_write_idx, new_idx;
   // SW should do intelligent retry & back-off; while loop shown for simplicity
  while (true){
     // Get Read_Index before Write_Index to always get consistent values
     read_idx = *Read_Index;
       _atomic_thread_fence(__ATOMIC_ACQ_REL);
     old_write_idx = *Write_Index;
     if (read_idx > old_write_idx){
        // Only happens if Write_Index wraps or ring has bad setup
        return 1;
     }
     new_idx = old_write_idx + eng_num;
     if (new_idx - read_idx > ring_size) {
                                         // Not enough free entries. try again
        continue:
     if ( SDXI_MULTI_PRODUCER ){
        // Try to atomically update Write_Index before other threads with
        // compare_exchange operation. This built-in also ensures the required
        // single-threaded guarantees wrt the update of Write_Index.
        bool success =
           ___atomic_compare_exchange_n( Write_Index, &old_write_idx, new_idx,
                                       false, __ATOMIC_SEQ_CST, __ATOMIC_SEQ_CST );
        if (success) break; // Updated Write_Index, no need to try again.
     }
     else {
         // Single-Producer case
        *Write_Index = new_idx;
         __atomic_thread_fence(__ATOMIC_ACQ_REL);
                 // Always successful for single-producer
        break;
     }
      // Couldn"t update Write_Index, try again.
  }
  // Write_Index is now advanced. Let's write out entries to the ring.
  update_ring(eng_entries, eng_num, ring_base, ring_size, old_write_idx);
   // Door_Bell write required; only needs ordering wrt update of Write_Index.
  *Door_Bell = new_idx;
  return 0;
}
```

### 5.2.1 Multi-Producer Enqueue

Multiple producer entities may use the enqueue procedure described in the prior section in parallel without the use of locks or other higher-level serialization methods applied to the descriptor ring. While a context's Write\_Index value in memory must be monotonically increasing, the SDXI function may see doorbell writes to a context whose values are not monotonically increasing. The function shall discard such doorbell writes.

## 5.3 Descriptor Processing

This section describes the specific behavior required of an SDXI function as it processes a descriptor through the phases of parsing, execution, and completion. With respect to each other, descriptors within the same context are parsed in order, but may execute and complete out-of-order or in parallel.

**Descriptor Parsing Main Loop:** 

- 1. The function derives values for the context's memory-resident CXT\_STS.state, Read\_Index, Write\_Index, and the descriptor at Read\_Index.
  - a. For each reference to a memory-resident value in this parsing loop, the function may source the value from either an internal function cache or the architected locations in memory as appropriate unless explicitly stated to the contrary.
  - b. If during this parsing loop, errors occur when the function reads or writes these values to memory, the function shall log an appropriate error and end this parsing loop by proceeding to step 11.
  - c. When the function serializes a value to memory, it shall ensure that any internal source of the value matches its memory-resident value; if not, the function shall update the value in memory.
- 2. The function shall examine the context's CXT\_STS.state and then perform the following actions:
  - a. If the context is not at CXTV\_RUN, the function shall end this parsing loop by proceeding to step 11.
  - b. If the context is at CXTV\_RUN, the function may serialize the Read\_Index value in memory as appropriate so software can make good forward progress.
- 3. If the function receives a doorbell\_value for the context or fencing was required on the current descriptor and has completed; the function's context parsing shall go to step 4.
  - a. The parsing may also go to step 4 for implementation-specific reasons.
  - b. Otherwise, the function's descriptor parsing for the context goes back to step 1.
- 4. The function shall perform checks on the context's Read\_Index and Write\_Index values as follows.
  - a. If Write\_Index is less than Read\_Index, the function shall log an error and end this parsing loop by proceeding to step 11.
  - b. If Write\_Index minus Read\_Index is greater than the context's descriptor ring size (ds\_ring\_sz), the function shall log an error and end this parsing loop by proceeding to step 11.
  - c. If Read\_Index is less than Write\_Index, the function's descriptor parsing may proceed to step 5.
  - d. Otherwise, the parsing shall go back to step 1.
- 5. The function shall poll the valid bit of the descriptor at Read\_Index until it is valid or until an implementation-specific timeout period has been reached. If timeout happens, the function shall go to step 11 and log a timeout error (Step 07h, cv=1, div=1, bf=0, sub\_step=3, re=1, err\_class=0x2500).
- 6. The function shall check the validity of the descriptor's type, subtype, chain, fence, and sequential fields for the current context. If any are invalid, the function shall log an appropriate error and end this parsing loop by proceeding to step 11. The type and subtype fields specify the descriptor operation that the function shall check for validity by the following steps.

- a. If the function does not support the operation, then the operation is invalid.
- b. If the operation is a supported, AdminGrp descriptor operation, but the current context number is not "0" (Admin Context), then the operation is invalid.
- c. If the operation is a supported operation from a supported optional descriptor group, then there is a corresponding bit field, "x", in the op\_grp (operation group) fields. If the following condition is not true, then the operation is invalid:
- d. MMIO\_CAP1.opb\_000\_cap[x] & MMIO\_CTL2.opb\_000\_avl[x] & CXT\_L1.opb\_000\_enb[x]
- e. Otherwise, the operation is valid.
- 7. The function shall check the fence bit in the descriptor header. If it is set, the function shall poll for previously executing descriptors of the context to complete until an implementation-specific timeout period has been reached.
  - a. If timeout happens, the function shall go back to step 1. If the previously executing descriptors complete with error, the function shall end this parsing loop by proceeding to step 11.
  - b. If these descriptors complete successfully and CXT\_STS.state is CXTV\_RUN, the parsing continues to the next step; otherwise parsing retruns to step 1.
- 8. The function shall ensure that the descriptor's valid field in memory is set to invalid. If the write succeeds, the function goes to step 9.
- 9. The function shall perform the following in any order or in parallel.
  - a. The function shall initiate the background execution of the current descriptor using the execution flow later in this section. The execution may be asynchronous to this parsing loop, thus the function does not need to wait for the execution to complete before parsing new descriptors.
  - b. The function shall increment the derived value of Read\_Index. Note, Read\_Index is not required to be serialized to memory at this time. However, it is recommended that an SDXI function implementation serialize Read\_Index frequently enough to allow software to perform simple load balancing between contexts.)
- 10. The function shall continue descriptor parsing for the context at step 1.

### Descriptor Parsing End:

- 11. If no error has occurred in parsing the descriptor, then descriptor parsing ceases and the function proceeds to the "Execution & Completion Phase".
- 12. If an error has occurred in parsing the descriptor, the function shall initiate a LogErr:Cxt and StopErr:Cxt action on the context in any order or in parallel. Descriptor parsing ceases and no execution occurs. Note, both the descriptor's valid field and the descriptor's completion status block are not modified in this case.

### Execution & Completion Phase:

- 1. The function shall execute the current descriptor operation including accessing any referenced AKeys, RKeys, and data buffers. DSH may be applied to data buffer accesses; see "3.6, Data Steering Hints (DSH)" for more details. The execution is constrained by the following.
  - a. If the Sequential Consistency (S) flag is set in the descriptor, all writes from prior descriptor operations in the same context shall be made globally visible prior to making writes from the current descriptor operation globally visible, regardless of address.
- 2. The function shall consider the execution of the current descriptor finished when all background actions belonging to the descriptor are no longer executing. For each action belonging to the current descriptor, one of the following shall be true.
  - a. The action has been fully performed.
  - b. The action has encountered an error and was stopped before proceeding further.
  - c. The action was stopped or not even begun because of an error in another action belonging to the same descriptor or the entire descriptor was aborted due to the context stopping.
- 3. When the execution of the current descriptor is finished, the function proceeds to step 4.
- 4. If no descriptor execution errors occurred, and the descriptor's "np" field is "1", then the descriptor has completed.
- 5. If no descriptor execution errors occurred, and the descriptor's "np" field is "0", the function shall update the CST\_BLK.signal field. If no error occurs in updating the field, then the descriptor has completed. If an error occurs in updating the field, the function starts the SignalErr:Cxt action, which shall set CST\_BLK.er and start the StopErr:Cxt and LogErr:Cxt actions in any order or in parallel. The descriptor is considered complete when the StopErr:Cxt action completes. (See "3.4, Error Log" for details.)
- 6. If a descriptor execution error occurred and the descriptor's "np" field is "1", then the function shall start the StopErr:Cxt action and LogErr:Cxt action in any order or in parallel. The descriptor is considered complete when the StopErr:Cxt action completes. (See "3.4, Error Log" for details.)
- 7. If a descriptor execution error occurred and the descriptor's "np" field is "0", then the function shall start the StopErr:Cxt action, LogErr:Cxt action, and DescrErr:Cxt action (which shall set CST\_BLK.er and update the CST\_BLK,signal fields) in any order or in parallel. The descriptor is considered complete when the StopErr:Cxt action completes. (See "3.4, Error Log" for details.)

## 5.4 Descriptor Ordering and Parallel Execution

An SDXI function may prefetch any number of descriptors from a descriptor ring up to the location prior to that indicated by the Write\_Index value in memory. Dynamic modification of valid descriptors located between Read\_Index and Write\_Index-1 is not supported.

The Fence (F) flag may be used to constrain parallel execution. When the Fence flag is set, the function shall wait for all prior descriptor operations to complete before executing the fenced descriptor operation. When the Fence flag is clear, the descriptor operation may be executed while prior descriptor operations are outstanding.

The Sequential Consistency (S) flag may also affect parallel execution. See section "5.6, Memory Consistency Model" for more details.

There are no ordering or consistency requirements for descriptors belonging to different contexts.

## 5.5 Descriptor Completion

When each descriptor operation has completed, the function will appropriately update the referenced CST\_BLK completion signal in memory. Descriptors may complete out of order. It is up to software to assign completion signal locations and initial values as appropriate. Some applications may require precise tracking of descriptors while others may track groups of descriptors.

When a descriptor encounters an error during processing and causes its context to stop, there may be multiple prior and/or subsequent descriptors from the same context being processed in parallel. These may complete independently with or without error.

## 5.6 Memory Consistency Model

Unless otherwise specified, the order in which the SDXI function reads and writes buffers and other data structures associated with executing an SDXI descriptor is undefined. Software shall not rely upon the ordering of the data transfer within a single descriptor and thus shall not overlap the source buffer, destination buffer, Atomic Return Data, or completion status block.

The SDXI function shall ensure that when clearing the descriptor header Valid bit, it shall be made globally visible ahead of making any writes associated with the descriptor operation globally visible and ahead of making any writes to the descriptor's Completion Status globally visible.

All writes associated with a descriptor operation shall be made globally visible prior to making changes to the descriptor's Completion Status globally visible. Within the Completion Status, changes to Error Recorded (ER) field shall be made globally visible prior to making updates to the descriptor's completion signal globally visible.

If the Sequential Consistency (S) flag is set, all writes from prior descriptor operations in the same context shall be made globally visible prior to making writes from the current descriptor operation globally visible, regardless of address.

If the Sequential Consistency flag is clear, writes associated with the current descriptor operation may be reordered ahead of, and made globally visible prior to, writes associated with prior descriptor operations that have not yet been made globally visible.

Unlike the fence flag, the sequential consistency flag allows multiple descriptors to be executed in parallel as long as the rules around global visibility are maintained.

Developer Note: This note is regarding the use of Sequential Consistency Flag and PCIe Relaxed Ordering. In general, data buffer writes associated with a single descriptor operation may be issued over the PCIe bus in any order.

If the S flag is clear, all writes associated with the descriptor operation may be issued as Relaxed Ordered (RO=1) writes. Additionally, writes associated with the descriptor operation may be issued ahead of writes associated with prior descriptor operations.

If the S flag is set, all writes associated with a descriptor operation need to be issued as non-relaxed (RO=0) over the PCIe bus, and only after writes associated with all prior descriptor operations have been issued on the bus.

## 5.7 Descriptor Chaining

The descriptor chaining mechanism may be used for either Extended Descriptors or as part of the Error Log. See *"5.7.1, Extended Descriptors"* and *"3.4, Error Log"* for more details.

A descriptor chain is formed from 2 or more adjacent 64-byte descriptor entries. Each entry in the chain is referred to as a link. Each link is formatted as a standard SDXI descriptor with an SDXI header and footer. Each link except for the last shall set the Chain (C) bit to 1 in their respective headers. The last link in the chain is indicated by a descriptor with the Chain bit clear.

Software shall reserve enough space in the descriptor ring to enqueue the entire chain. The individual descriptor entries forming the chain may be written into the descriptor ring in arbitrary order.

When processing a descriptor chain, an SDXI function shall increment Read\_Index by 1 and clear the header Valid bit for each link in the chain.

### 5.7.1 Extended Descriptors

Extended descriptors may be utilized by operations that require more than the 52 bytes of operand space provided by a single SDXI descriptor. Additional operand space may be created by chaining 2 or more descriptor entries. Each operation definition shall indicate whether the use of an extended descriptor is required.

Other than the Chain bit, extended descriptors utilize the same descriptor header for each link in the chain.

Only the last link in the chain contains a valid descriptor footer with the pointer to the completion status block. The descriptor footers for all other links in the chain are reserved. Only one CST\_BLK completion signal update is performed for the entire extended descriptor.

Error log entries are generated for the extended descriptor as a whole.

An SDXI function processing an extended descriptor shall wait until all links are valid before executing the descriptor operation.

### 5.8 Descriptor Driven Interrupts

Interrupts generated by descriptors do not set any sort of internal MMIO status as may be present in traditional devices. Any interrupt status is expected to be maintained in memory.

For these interrupts, the function does not explicitly know when interrupt service is complete. These interrupts must be edge triggered as the function does not know when to de-assert a level-sensitive interrupt. Additionally, if MSI or MSI-X Pending bits are implemented, the Pending state will not be change from Set to Clear while the associated vector is masked.

An SDXI function provides programmed-data acceleration by reading and executing a series of memorybased, naturally-aligned, 64-byte "descriptors". Each descriptor's "opcode" field encodes a requested operation and the remainder of the descriptor specifies additional parameters. The format of each supported descriptor is defined in this chapter. The operation of the descriptor ring and the processing, execution, and completion of descriptors is discussed in *"Chapter 5, SDXI Descriptor Ring Operation"*.

For the purposes of standardized enumeration and implementation, SDXI arranges sets of related descriptor operations into an operation group; each operation within a group has a shared "type" encoding combined with an unique "subtype" encoding. The enumeration and enabling for most operation groups are provided by a set of operation-group ("op\_grp") fields implemented in MMIO\_CAP1.opb\_000\_cap, MMIO\_CTL2.opb\_000\_avI, CXT\_L1.opb\_000\_enb. The mechanism by which these fields are used is described in *"5.1, Descriptor Operations"*; their encoding for specific operation groups are defined in *"Table 6–1: SDXI Operation* Groups".

| Grou  | ip Fields |   | Operation Group Definitions                  |                      |                                                         |  |  |
|-------|-----------|---|----------------------------------------------|----------------------|---------------------------------------------------------|--|--|
| Bits  | Mapping   |   | Operation Group                              | Туре                 | Required for SDXI Function?                             |  |  |
| 0     | Reserved  |   | (Reserved)                                   | 0x000                | Reserved                                                |  |  |
| 1     | Reserved  |   | DMA Base Operation<br>Group (DmaBaseGrp)     | 0x001                | Mandatory, not-enumerable                               |  |  |
| 2     | Reserved  |   | Administrative Operation<br>Group (AdminGrp) | 0x002                | Mandatory, not-enumerable; supported in context 0 only. |  |  |
| 3     | Maps to   | ⊅ | Full Atomic Operation<br>Group (AtomicGrp)   | 0x003                | Optional, Requires Enablement                           |  |  |
| 4     | Maps to   | ⊅ | Interrupt Operation Group<br>(IntrGrp)       | 0x004                | Optional, Requires Enablement                           |  |  |
| 5     | Maps to   | ⊅ | Minimal Atomic Operation<br>Group            | 0x003<br>(not 0x005) | Optional, Requires Enablement                           |  |  |
| 27-6  | Maps to   | ₽ | Reserved                                     | 0x01B - 0x006        | Reserved                                                |  |  |
| 31:28 | Reserved  |   | Vendor-Defined                               | 0x01F - 0x01C        | Optional, Requires Enablement                           |  |  |
|       |           | - | Reserved                                     | 0x7F6 – 0x020        | Reserved                                                |  |  |
|       |           |   | Reserved for Error Log                       | 0x7F7                | Mandatory, not-enumerable                               |  |  |
|       |           |   | Reserved                                     | 0x7F8 – 0x7FF        | Reserved                                                |  |  |

### Table 6–1: SDXI Operation Groups

**Operation-**

| Operation Group                        | Туре          | Subtype | Operation        |
|----------------------------------------|---------------|---------|------------------|
| (Reserved)                             | 0x000         |         |                  |
| DMA Base Operation Group               | 0x001         | 0x01    | DMAB_NOP         |
| (DmaBaseGrp)                           |               | 0x02    | DMAB_WRT_IMM     |
|                                        |               | 0x03    | DMAB_COPY        |
|                                        |               | 0x04    | DMAB_REPCOPY     |
| Administrative Operation Group         | 0x002         | 0x00    | DSC_FN_UPD       |
| (AdminGrp)                             |               | 0x01    | DSC_CXT_UPD      |
|                                        |               | 0x02    | DSC_AKEY_UPD     |
|                                        |               | 0x03    | DSC_CXT_START_NM |
|                                        |               | 0x04    | DSC_CXT_STOP     |
|                                        |               | 0x05    | DSC_ADM_INTR     |
|                                        |               | 0x06    | DSC_SYNC         |
|                                        |               | 0x07    | DSC_RKEY_UPD     |
|                                        |               | 0x08    | DSC_CXT_START_RS |
| Full Atomic Operation Group            | 0x003         | 0x00    | Reserved         |
| (AtomicGrp)                            |               | 0x01    | DSC_ATM_SWAP1    |
|                                        |               | 0x02    | DSC_ATM_UADD1    |
|                                        |               | 0x03    | DSC_ATM_USUB     |
|                                        |               | 0x04    | Reserved         |
|                                        |               | 0x05    | DSC_ATM_AND      |
|                                        |               | 0x06    | DSC_ATM_OR       |
|                                        |               | 0x07    | DSC_ATM_XOR      |
|                                        |               | 0x08    | DSC_ATM_SMIN     |
|                                        |               | 0x09    | DSC_ATM_SMAX     |
|                                        |               | 0x0A    | DSC_ATM_UMIN     |
|                                        |               | 0x0B    | DSC_ATM_UMAX     |
|                                        |               | 0x0C    | DSC_ATM_UCLAMPI  |
|                                        |               | 0x0D    | DSC_ATM_UCLAMPD  |
|                                        |               | 0x0E    | DSC_ATM_CMPSWAP1 |
| Interrupt Operation Group<br>(IntrGrp) | 0x004         | 0x00    | DSC_INTR         |
| Reserved                               | 0x005         |         | Reserved         |
| Reserved                               | 0x01B - 0x006 |         | Reserved         |
| Vendor-Defined Operation               | 0x01F - 0x01C |         |                  |
| Reserved                               | 0x7F6 – 0x020 |         |                  |
| Reserved for Error Log                 | 0x7F7         |         |                  |
| Reserved                               | 0x7F8 – 0x7FF |         | Reserved         |

 Table 6–2:
 SDXI Operation Groups, Types, and Subtypes

1. Also supported when Minimal Atomic Operations are present.

## 6.1 Descriptor Format for SDXI Operations

### 6.1.1 Common Header and Footer

All SDXI operations use the following common header and footer encodings in their descriptor format.

| 31 30 29 28 27 | 26 25 24 23 22 21 20 19 18 17 16 | 15 14 13 12 11 10 9 8 | 76  | 54  | 3  | 2  | 1 0   |        |
|----------------|----------------------------------|-----------------------|-----|-----|----|----|-------|--------|
| rsv            | type                             | subtype               | rsv | CSI | ch | fe | se vl | + 0x00 |
|                | opera                            | ation                 |     |     |    |    |       | + 0x04 |
|                |                                  |                       |     |     |    |    |       | + 0x08 |
|                |                                  |                       |     |     |    |    |       | + 0x0C |
|                |                                  |                       |     |     |    |    |       | + 0x10 |
|                |                                  |                       |     |     |    |    |       | + 0x14 |
|                |                                  |                       |     |     |    |    |       | + 0x18 |
|                |                                  |                       |     |     |    |    |       | + 0x1C |
|                |                                  |                       |     |     |    |    |       | + 0x20 |
|                |                                  |                       |     |     |    |    |       | + 0x24 |
|                |                                  |                       |     |     |    |    |       | + 0x28 |
|                |                                  |                       |     |     |    |    |       | + 0x2C |
|                |                                  |                       |     |     |    |    |       | + 0x30 |
|                |                                  |                       |     |     |    |    |       | + 0x34 |
|                | csb                              | ptr                   |     |     | rs | SV | np    | + 0x38 |
|                |                                  |                       |     |     |    |    |       | + 0x3C |

#### Figure 6-1: Common Header and Footer

| Field             | Bits    | Subfield | Description                                                                                                                                                                                                                                                                                                                                                                                                                                           |
|-------------------|---------|----------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u32 opcode;       | 000     | VI       | <ul> <li>Valid.</li> <li>1 = Descriptor is valid. All fields may be processed.</li> <li>0 = Descriptor is invalid.</li> <li>All other fields within the descriptor shall be ignored. The function shall poll the descriptor until it becomes valid or until an implementation defined timeout period has expired at which point an error is logged.</li> </ul>                                                                                        |
|                   | 001     | se       | Sequential Consistency<br>1 = Operation writes are sequentially consistent<br>0 = Operation writes are not required to be sequentially consistent<br>See "5.6, Memory Consistency Model" for more details.                                                                                                                                                                                                                                            |
|                   | 002     | fe       | <ul> <li>Fence. See "5.4, Descriptor Ordering and Parallel Execution" for more details.</li> <li>1 = All prior descriptor operations shall complete before executing this descriptor's operation</li> <li>0 = Execution of this descriptor's operation is permitted prior to the completion of prior descriptor operations</li> </ul>                                                                                                                 |
|                   | 003     | ch       | Chain. See <i>"5.7, Descriptor Chaining"</i> for more details.<br>1 = Start or middle of a set of chained descriptors.<br>0 = End of chain, or non-chained descriptor.                                                                                                                                                                                                                                                                                |
|                   | 004     | csr      | Completion Status Mode Requirement for the descriptor. See<br><i>"4.4.2, Completion-Status Modes"</i> for details.<br>0 = Atomic completion-status mode shall be used.<br>1 = Simple completion-status mode shall be used.                                                                                                                                                                                                                            |
|                   | 007:005 | rsvd     | Shall be set to 0                                                                                                                                                                                                                                                                                                                                                                                                                                     |
|                   | 015:008 | subtype  | Specifies an operation within the Operation Group.                                                                                                                                                                                                                                                                                                                                                                                                    |
|                   | 026:016 | type     | Specifies the Operation Group.                                                                                                                                                                                                                                                                                                                                                                                                                        |
|                   | 031:027 | rsvd     | Shall be set to 0                                                                                                                                                                                                                                                                                                                                                                                                                                     |
| u8 operation[52]; | 447:032 |          | This region is defined separately for each operation.                                                                                                                                                                                                                                                                                                                                                                                                 |
| u64 csb_ptr;      | 448     | np       | no_pointer.<br>When 0, the SDXI function shall update the completion status<br>blocked pointed to by csb_ptr.<br>When 1, there is no completion status block associated with the<br>descriptor to update and the csb_ptr shall be ignored.<br>This mechanism avoids the overhead of completion-block-status<br>processing for a series of descriptor requests that are ordered<br>before a terminal one which will specify a completion status block. |
|                   | 452:449 | rsvd     | Shall be set to 0                                                                                                                                                                                                                                                                                                                                                                                                                                     |
|                   | 511:453 | csb_ptr  | Completion Status Block pointer.<br>A pointer to a 32B aligned region of memory containing the<br>Completion Status block. An invalid pointer may result in an error.                                                                                                                                                                                                                                                                                 |

### 6.1.2 Completion Status Block

When a descriptor's "np" ("no\_pointer") field is "0", the descriptor's "csb\_ptr" field points to a completion status block ("CST\_BLK") whose fields are used by the SDXI function to update (modify) the descriptor operation's completion status and report relevant errors. (The manner in which the SDXI function updates the CST\_BLK during descriptor processing is discussed in *"5.3, Descriptor Processing"*.) The producer of the descriptor initializes the CST\_BLK before submitting the descriptor.

As discussed in "4.4.2, Completion-Status Modes", the producer may require or be required to use one of two producer completion modification modes for the context with respect to the atomic modification of the CST\_BLK. The availability of the modes is reported by privileged software through the CXT\_CTL.csa ("completion-status mode availability") field and may be reported to the context by means outside the scope of the SDXI specification. The producer complies to and specifies the mode to the SDXI function by setting the descriptor's "csr" ("completion status requirement") field.

| <u>31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0</u> |        |
|----------------------------------------------------------------------------------------------|--------|
| signal                                                                                       | + 0x00 |
|                                                                                              | + 0x04 |
| er rsv                                                                                       | + 0x08 |
| rsv                                                                                          | + 0x0C |
|                                                                                              | + 0x10 |
|                                                                                              | + 0x14 |
|                                                                                              | + 0x18 |
|                                                                                              | + 0x1C |

#### Figure 6-2: Completion Status Block

The completion signal field within the CST\_BLK is used by the producer to track the completion of one or more descriptors. It is updated by the SDXI function as appropriate for the completion-status mode indicated by the descriptor's "csr" field. When the signal reaches an appropriate terminal value, the producer may conclude that the associated descriptors have finished and may rely upon the other CST\_BLK fields to determine success, failure, or other associated information.

| Field          | Bits    | Subfield | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                              |
|----------------|---------|----------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u64 signal;    | 063:000 |          | Completion Signal value.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
| u32 flags;     | 094:064 | rsvd     | Shall be set to 0                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |
|                | 095     | er       | Error Recorded (er).<br>By initializing this field to "0" before issuing descriptors associated with<br>the CST_BLK, software can rely upon this behavior: when "1", at least<br>one completed descriptor has encountered an error; when "0", all<br>completed descriptors have not encountered an error.<br>During the processing of an associated descriptor, an SDXI function shall<br>operate on this field as follows: if the descriptor succeeds, the value of<br>the field shall be preserved; and<br>if the descriptor encounters an error, the value of the field shall be "1". |
| u8 rsvd_0[20]; | 255:096 |          | Shall be set to 0                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                        |

#### Table 6–4: CST\_BLK<sup>[^3]</sup> (Completion Status Block)

Multiple descriptors may point to the same single CST\_BLK when atomic completion-status mode is available and used for all associated descriptors. The producer shall refrain from this usage when the mode is not available. During this usage, if any descriptor associated with the CST\_BLK does not specify

atomic completion-status mode, the SDXI function may return indeterminate contents of the CST\_BLK; the producer shall avoid and not rely upon this behavior.

For each descriptor error reported through CST\_BLK, the SDXI function shall ensure that the "er" ("error recorded") field is set to "1" before modifying the completion signal value. When multiple descriptors are associated with a single CST\_BLK, the SDXI function may set the "er" field to "1" for errors encountered on one or more of the descriptors.

### 6.1.3 Attribute Field

The attribute (Attr) field controls features related to accessing a data buffer. Within a descriptor definition, a postfix may be added to the field name to uniquely identify a specific data buffer (ex. Attr (Src), Attr (Dst)).

Subsequently defined descriptors may contain data buffer pointers which have a corresponding Attr field.

| Bits | Field            | Description                                                                                                                                                                                                                                                                                                                                                                                                               |  |  |
|------|------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|
| 1:0  | CoherencyControl | <ul> <li>00b : The associated memory location is accessed as I/O coherent.</li> <li>01b : The associated memory location is accessed as non-coherent.</li> <li>10b : The associated memory location is accessed as I/O coherent. DSH information is requested to be included when accessing this memory location. See "3.6, Data Steering Hints (DSH)" for more details.</li> <li>All other encodings reserved</li> </ul> |  |  |
| 2    | rsvd             | Shall be set to 0                                                                                                                                                                                                                                                                                                                                                                                                         |  |  |
| 3    | MMIO             | <ul> <li>1b: The referenced memory buffer may be located in either system memory or MMIO space.</li> <li>0b: The reference memory buffer is located in system memory and is not located in an MMIO space. This setting may optimize performance in some implementations.</li> </ul>                                                                                                                                       |  |  |

| <b>Fable</b> | 6-5: | Attribute | Field |
|--------------|------|-----------|-------|
|              |      | /         |       |

| Figure 6-3: AttributeForma |
|----------------------------|
|----------------------------|

| 0    |      |               |
|------|------|---------------|
| 3    | 2    | 1:0           |
| MMIO | rsvd | Coherency Ctl |

## 6.2 DMA Base Operations Group (DmaBaseGrp)

All SDXI implementations are required to support the DmaBaseGrp operations.

### 6.2.1 DmaBaseGrp: DSC\_DMAB\_NOP

The DmaBaseGrp's Nop operation performs no functional DMA operation. A descriptor specifying the operation is processed and ordered the same as other descriptors, obeys all relevant bits in the header, and generates the completion signal.

| -                 |                         |             |                          |       |        |       |      |
|-------------------|-------------------------|-------------|--------------------------|-------|--------|-------|------|
|                   | Figure 6-4: DSC         | _DMAB_N     | <b>IOP Descriptor Fo</b> | ormat |        |       |      |
| 31 30 29 28 27 26 | 25 24 23 22 21 20 19 18 | 17 16 15 14 | 13 12 11 10 9 8          | 7 6 5 | 4 3    | 2 1   | 0    |
| rsv               | type = 0x001            |             | subtype = 0x01           | rsv   | csr ch | fe se | vl - |
|                   |                         | rsv         |                          |       |        |       | -    |
|                   |                         |             |                          |       |        |       | -    |
|                   |                         |             |                          |       |        |       | 4    |
|                   |                         |             |                          |       |        |       |      |
|                   |                         |             |                          |       |        |       | -    |
|                   |                         |             |                          |       |        |       |      |
|                   |                         |             |                          |       |        |       |      |
|                   |                         |             |                          |       |        |       |      |
|                   |                         |             |                          |       |        |       |      |
|                   |                         |             |                          |       |        |       | 1    |
|                   |                         |             |                          |       |        |       | -    |
|                   |                         |             |                          |       |        |       | -    |
|                   |                         |             |                          |       |        |       | -    |
|                   |                         |             |                          |       | -      |       | -    |
|                   |                         | csb_ptr     |                          |       | rs۱    | /     | np - |
|                   |                         |             |                          |       |        |       | H    |
|                   |                         |             |                          |       |        |       |      |

#### Table 6–6: DSC\_DMAB\_NOP<sup>[^7]</sup> Descriptor Format

| Field          | Bits    | Subfield     | Description                                                     |  |  |
|----------------|---------|--------------|-----------------------------------------------------------------|--|--|
| u32 opcode;    | 000     | vl = 1       | Valid. See section "6.1.1".                                     |  |  |
|                | 001     | se = 0       | Sequential Consistency. See section "6.1.1". Shall be set to 0. |  |  |
|                | 002     | fe = *       | Fence. See section "6.1.1".                                     |  |  |
|                | 003     | ch = 0       | Chain. See section <i>"6.1.1".</i> Shall be set to 0.           |  |  |
|                | 004     | csr          | Completion Status Mode Requirement for the descriptor.          |  |  |
|                | 007:005 | rsvd         | Shall be set to 0                                               |  |  |
|                | 015:008 | subtype=0x01 | See section "6.1.1".                                            |  |  |
|                | 026:016 | type=0x001   | See section "6.1.1".                                            |  |  |
|                | 031:027 | rsvd         | Shall be set to 0                                               |  |  |
| u8 rsvd_0[52]; | 447:032 |              | Shall be set to 0.                                              |  |  |
| u64 csb_ptr;   | 448     | np           | no_pointer. See section "6.1.1".                                |  |  |
|                | 452:449 | rsvd         | Shall be set to 0                                               |  |  |
|                | 511:453 | csb_ptr      | Completion Status Block pointer. See section "6.1.1".           |  |  |

0x00 0x04 0x08 0x0C 0x10 0x14 0x18 0x1C 0x20 0x24 0x28 0x2C 0x30 0x34 0x38 0x3C

## 6.2.2 DmaBaseGrp: DSC\_DMAB\_WRT\_IMM Operation

The DmaBaseGrp's WriteImm operation is used to write up to 32 contiguous bytes starting at a destination memory address. The data is supplied inside the descriptor. Following little-endian conventions, data byte 0 is written to the first byte of the destination address.

Because this operation can be used for sending triggers, flags, or other signals to asynchronous processes, the following minimum guarantees against torn writes are required:

- bsize = 0; One-byte immediate writes to any address is atomic
- bsize = 1; Two-byte immediate writes to any even address is atomic
- bsize = 3; Four-byte immediate writes to any address divisible by four is atomic
- bsize = 7; Eight-byte immediate writes to any address divisible by eight is atomic

The atomicity of immediate writes of other sizes, and other alignments, is implementation specific.

| 31 30 29 28 27 | 26 25 24 23 22 21 20 19 18 17 16 | 15 14 13 12 11 10 9 8 | 7 6 5 | 4 3 2 1 0       |        |
|----------------|----------------------------------|-----------------------|-------|-----------------|--------|
| rsv            | type = 0x001                     | subtype = 0x02        | rsv   | csr ch fe se vl | + 0x00 |
|                | r                                | SV                    |       | bsize           | + 0x04 |
|                | rsv                              |                       | rsv   | attr_dst        | + 0x08 |
|                | rsv                              | ak                    | ey0   |                 | + 0x0C |
|                | ado                              | dr0                   |       |                 | + 0x10 |
|                |                                  |                       |       |                 | + 0x14 |
|                | da                               | ita                   |       |                 | + 0x18 |
|                |                                  |                       |       |                 | + 0x1C |
|                |                                  |                       |       |                 | + 0x20 |
|                |                                  |                       |       |                 | + 0x24 |
|                |                                  |                       |       |                 | + 0x28 |
|                |                                  |                       |       |                 | + 0x2C |
|                |                                  |                       |       |                 | + 0x30 |
|                |                                  |                       |       |                 | + 0x34 |
|                | csb                              | _ptr                  |       | rsv np          | + 0x38 |
|                |                                  |                       |       |                 | + 0x3C |

#### Figure 6-5: DSC\_DMAB\_WRT\_IMM Descriptor Format

| Table 6–7: | DSC | DMAB | WRT | IMM <sup>[^7]</sup> | <b>Descriptor Format</b> |
|------------|-----|------|-----|---------------------|--------------------------|
|            |     |      |     |                     |                          |

| Field         | Bits    | Subfield     | Description                                            |  |  |
|---------------|---------|--------------|--------------------------------------------------------|--|--|
| u32 opcode;   | 000     | vl = 1       | Valid. See section "6.1.1".                            |  |  |
|               | 001     | se = *       | Sequential Consistency. See section "6.1.1".           |  |  |
|               | 002     | fe = *       | Fence. See section "6.1.1".                            |  |  |
|               | 003     | ch = 0       | Chain. See section "6.1.1". Shall be set to 0.         |  |  |
|               | 004     | csr          | Completion Status Mode Requirement for the descriptor. |  |  |
|               | 007:005 | rsvd         | Shall be set to 0                                      |  |  |
|               | 015:008 | subtype=0x02 | See section "6.1.1".                                   |  |  |
|               | 026:016 | type=0x001   | See section "6.1.1".                                   |  |  |
|               | 031:027 | rsvd         | Shall be set to 0                                      |  |  |
| u32 size;     | 036:032 | bsize        | Specifies the number of bytes to write minus 1.        |  |  |
|               |         |              | 0x0 = 1 Byte,0x1F = 32 Bytes                           |  |  |
|               | 063:037 | rsvd         | Shall be set to 0                                      |  |  |
| u8 attr;      | 067:064 | attr_dst     | Destination buffer attributes                          |  |  |
|               | 071:068 | rsvd         | Shall be set to 0                                      |  |  |
| u8 rsvd_0[3]; | 095:072 |              | Shall be set to 0.                                     |  |  |
| u16 akey0;    | 111:096 | akey0_dst    | Destination buffer AKey                                |  |  |
| u8 rsvd_1[2]; | 127:112 |              | Shall be set to 0.                                     |  |  |
| u64 addr0;    | 191:128 | (dst)        | Destination buffer starting address                    |  |  |
| u8 data[32];  | 447:192 |              | 32-bytes of immediate data.                            |  |  |
| u64 csb_ptr;  | 448     | np           | no_pointer. See section "6.1.1".                       |  |  |
|               | 452:449 | rsvd         | Shall be set to 0                                      |  |  |
|               | 511:453 | csb_ptr      | Completion Status Block pointer. See section "6.1.1".  |  |  |

## 6.2.3 DmaBaseGrp: DSC\_DMAB\_COPY Operation

The DmaBaseGrp's Copy operation copies a source data buffer to a destination data buffer.

It is an error (Generic Descriptor Error/Unsupported Field Encoding) for the source and destination buffers to exceed the size described by CXT\_L1\_ENT.max\_buffer.

| 31 30 29 28 27 | 26 25 24 23 22 21 20 19 18 17 16 | 15 14 13 12 11 10 9 8 | 7 6 5 4 3 2 1 0                  | ]      |
|----------------|----------------------------------|-----------------------|----------------------------------|--------|
| rsv            | type = 0x001                     | subtype = 0x03        | rsv <mark>csr ch fe se vl</mark> | + 0x00 |
|                | si                               | ze                    |                                  | + 0x04 |
|                | rsv                              |                       | attr_dst attr_src                | + 0x08 |
|                | akey1                            | ako                   | ey0                              | + 0x0C |
|                | ado                              | 1r0                   |                                  | + 0x10 |
|                |                                  |                       |                                  | + 0x14 |
|                | ado                              | ir1                   |                                  | + 0x18 |
|                |                                  |                       |                                  | + 0x1C |
|                | rs                               | V                     |                                  | + 0x20 |
|                |                                  |                       |                                  | + 0x24 |
|                |                                  |                       |                                  | + 0x28 |
|                |                                  |                       |                                  | + 0x2C |
|                |                                  |                       |                                  | + 0x30 |
|                |                                  |                       |                                  | + 0x34 |
|                | csb                              | _ptr                  | rsv np                           | + 0x38 |
|                |                                  |                       |                                  | + 0x3C |

#### Figure 6-6: DSC\_DMAB\_COPY Descriptor Format

| Table 6–8: | DSC | DMAB | <b>Descriptor Format</b> |
|------------|-----|------|--------------------------|
|            |     |      |                          |

| Field          | Bits    | Subfield     | Description                                                                                |  |  |  |  |  |
|----------------|---------|--------------|--------------------------------------------------------------------------------------------|--|--|--|--|--|
| u32 opcode;    | 000     | vl = 1       | Valid. See section "6.1.1".                                                                |  |  |  |  |  |
|                | 001     | se = *       | Sequential Consistency. See section "6.1.1".                                               |  |  |  |  |  |
|                | 002     | fe = *       | Fence. See section "6.1.1".                                                                |  |  |  |  |  |
|                | 003     | ch = 0       | Chain. See section "6.1.1". Shall be set to 0.                                             |  |  |  |  |  |
|                | 004     | csr          | Completion Status Mode Requirement for the descriptor.                                     |  |  |  |  |  |
|                | 007:005 | rsvd         | Shall be set to 0                                                                          |  |  |  |  |  |
|                | 015:008 | subtype=0x03 | See section "6.1.1".                                                                       |  |  |  |  |  |
|                | 026:016 | type=0x001   | See section "6.1.1".                                                                       |  |  |  |  |  |
|                | 031:027 | rsvd         | Shall be set to 0                                                                          |  |  |  |  |  |
| u32 size;      | 063:032 |              | Specifies the number of bytes to write minus 1.<br>0x0 = 1 Byte,, 0xFFFF_FFF = 2**32 Bytes |  |  |  |  |  |
| u8 attr;       | 067:064 | attr_src     | Source buffer attributes                                                                   |  |  |  |  |  |
|                | 071:068 | dst          | Destination buffer attributes                                                              |  |  |  |  |  |
| u8 rsvd_0[3];  | 095:072 |              | Shall be set to 0.                                                                         |  |  |  |  |  |
| u16 akey0;     | 111:096 | (src)        | Source buffer AKey                                                                         |  |  |  |  |  |
| u16 akey1;     | 127:112 | (dst)        | Destination buffer AKey                                                                    |  |  |  |  |  |
| u64 addr0;     | 191:128 | (src)        | Source buffer starting address.                                                            |  |  |  |  |  |
| u64 addr1;     | 255:192 | (dst)        | Destination buffer starting address                                                        |  |  |  |  |  |
| u8 rsvd_1[24]; | 447:256 |              | Shall be set to 0.                                                                         |  |  |  |  |  |
| u64 csb_ptr;   | 448     | np           | no_pointer. See section "6.1.1".                                                           |  |  |  |  |  |
|                | 452:449 | rsvd         | Shall be set to 0                                                                          |  |  |  |  |  |
|                | 511:453 | csb_ptr      | Completion Status Block pointer. See section "6.1.1".                                      |  |  |  |  |  |

### 6.2.4 DmaBaseGrp: DSC\_DMAB\_REPCOPY Operation

The DmaBaseGrp's Repeat Copy operation copies a single 4-KiB-aligned source data buffer to multiple adjacent locations within a 4-KiB-aligned destination buffer. This operation may be used, for example, to fill a large region of memory.

It is an error (Generic Descriptor Error/Unsupported Field Encoding) to specify a total destination size (4Kbytes \* (nsize+1) \* (num+1)) greater than the size described by CXT\_L1\_ENT.max\_buffer.

| 31 30 29 28 27 26 25 24 23 22 21 20 | 19 18 17 16 | 15 14 13 12 | 11 10 9 8 | 7 6 5 4              | 3 2 1 0     |        |  |
|-------------------------------------|-------------|-------------|-----------|----------------------|-------------|--------|--|
| rsv type = 0x00                     | )1          | subtype     | e = 0x04  | rsv <mark>csr</mark> | ch fe se vl | + 0x00 |  |
| rsv                                 | nsize       | 2           |           | rsv                  |             | + 0x04 |  |
| r:                                  | SV          |             |           | attr_dst             | attr_src    | + 0x08 |  |
| akey1                               |             |             | ake       | ey0                  |             | + 0x0C |  |
| addr0_src                           |             |             |           | rsv                  | az          | + 0x10 |  |
|                                     |             |             |           |                      |             | + 0x14 |  |
| addr1_dst                           |             |             | rsv -     |                      |             |        |  |
|                                     |             |             |           |                      |             | + 0x1C |  |
| num                                 |             |             |           | rsv                  |             | + 0x20 |  |
|                                     | rs          | V           |           |                      |             | + 0x24 |  |
|                                     |             |             |           |                      |             | + 0x28 |  |
|                                     |             |             |           |                      |             | + 0x2C |  |
|                                     |             |             |           |                      |             | + 0x30 |  |
|                                     |             |             |           |                      |             | + 0x34 |  |
|                                     | csb         | _ptr        |           |                      | rsv np      | + 0x38 |  |
|                                     |             |             |           |                      |             | + 0x3C |  |

#### Figure 6-7: DSC\_DMAB\_REPCOPY Descriptor Format

### Table 6–9: DSC\_DMAB\_REPCOPY<sup>[^7]</sup> Descriptor Format

| Field          | Bits    | Subfield     | Description                                                                                                                                                                                                                                                                                                                                                                                                               |  |  |  |  |  |
|----------------|---------|--------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|--|--|
| u32 opcode;    | 000     | vl = 1       | Valid. See section "6.1.1".                                                                                                                                                                                                                                                                                                                                                                                               |  |  |  |  |  |
|                | 001     | se = *       | Sequential Consistency. See section "6.1.1".                                                                                                                                                                                                                                                                                                                                                                              |  |  |  |  |  |
|                | 002     | fe = *       | Fence. See section "6.1.1".                                                                                                                                                                                                                                                                                                                                                                                               |  |  |  |  |  |
|                | 003     | ch = 0       | Chain. See section <i>"6.1.1".</i> Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                                     |  |  |  |  |  |
|                | 004     | csr          | Completion Status Mode Requirement for the descriptor.                                                                                                                                                                                                                                                                                                                                                                    |  |  |  |  |  |
|                | 007:005 | rsvd         | Shall be set to 0                                                                                                                                                                                                                                                                                                                                                                                                         |  |  |  |  |  |
|                | 015:008 | subtype=0x04 | See section "6.1.1".                                                                                                                                                                                                                                                                                                                                                                                                      |  |  |  |  |  |
|                | 026:016 | type=0x001   | See section "6.1.1".                                                                                                                                                                                                                                                                                                                                                                                                      |  |  |  |  |  |
|                | 031:027 | rsvd         | Shall be set to 0                                                                                                                                                                                                                                                                                                                                                                                                         |  |  |  |  |  |
| u32 size;      | 043:032 | rsvd         | Shall be set to 0                                                                                                                                                                                                                                                                                                                                                                                                         |  |  |  |  |  |
|                | 052:044 | nsize        | The size of the source data buffer in 4-Kbyte increments minus 1.<br>0x0 = 4 Kbytes,, $0x1FF = 2$ Mbytes                                                                                                                                                                                                                                                                                                                  |  |  |  |  |  |
|                | 063:053 | rsvd         | Shall be set to 0                                                                                                                                                                                                                                                                                                                                                                                                         |  |  |  |  |  |
| u8 attr;       | 067:064 | attr_src     | Source buffer attributes                                                                                                                                                                                                                                                                                                                                                                                                  |  |  |  |  |  |
|                | 071:068 | attr_dst     | Destination buffer attributes                                                                                                                                                                                                                                                                                                                                                                                             |  |  |  |  |  |
| u8 rsvd_0[3];  | 095:072 |              | Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                                                                        |  |  |  |  |  |
| u16 akey0;     | 111:096 | (src)        | Source buffer AKey                                                                                                                                                                                                                                                                                                                                                                                                        |  |  |  |  |  |
| u16 akey1;     | 127:112 | (dst)        | Destination buffer AKey                                                                                                                                                                                                                                                                                                                                                                                                   |  |  |  |  |  |
| u64 addr0;     | 128     | az           | "All Zero"<br>When az=1, this indicates that the source buffer is all zero. This<br>allows the SDXI function to optimize the transfer (even avoiding<br>reading the buffer).<br>When az=0, no such guarantee is made. When az=1, SW shall<br>always initialize the entire source buffer to zero; otherwise, the<br>contents of the destination are not defined – and no error is required<br>to be reported in this case. |  |  |  |  |  |
|                | 139:129 | rsvd         | Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                                                                        |  |  |  |  |  |
|                | 191:140 | addr0_src    | 4KB aligned source buffer starting address.                                                                                                                                                                                                                                                                                                                                                                               |  |  |  |  |  |
| u64 addr1;     | 203:192 | rsvd         | Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                                                                        |  |  |  |  |  |
|                | 255:204 | addr1_dst    | 4KB aligned destination buffer starting address.                                                                                                                                                                                                                                                                                                                                                                          |  |  |  |  |  |
| u32 repeat;    | 267:256 | rsvd         | Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                                                                        |  |  |  |  |  |
|                | 287:268 | num          | Specifies the number of times the source buffer is copied to the destination buffer.<br>0x0 = 1 copy 0xF_FFFF = 2**20 copies.<br>The total destination size is:<br>4Kbytes * (nize+1) * (num+1).                                                                                                                                                                                                                          |  |  |  |  |  |
| u8 rsvd_1[20]; | 447:288 |              | Shall be set to 0.                                                                                                                                                                                                                                                                                                                                                                                                        |  |  |  |  |  |
| u64 csb_ptr;   | 448     | np           | no_pointer. See section "6.1.1".                                                                                                                                                                                                                                                                                                                                                                                          |  |  |  |  |  |
|                | 452:449 | rsvd         | Shall be set to 0                                                                                                                                                                                                                                                                                                                                                                                                         |  |  |  |  |  |
|                | 511:453 | csb_ptr      | Completion Status Block pointer. See section "6.1.1".                                                                                                                                                                                                                                                                                                                                                                     |  |  |  |  |  |

## 6.3 Atomic Operation Group (AtomicGrp)

This operation group performs various atomic operations. All AtomicGrp operations use the same descriptor format. An SDXI implementation may support these atomic operations or not. If supported, an SDXI Implementation may support the "Minimal" set or the "Full" set -- which includes the minimal set. If the SDXI function supports the full set, it shall not report support for the minimal set. Both sets use the same descriptor "type" and "subtype" for the minimal set.

Privileged software shall verify **both** of the following before exposing a set of atomic operations to an SDXI context. (See as discussed in "4.4, Atomic Operation Support")

- 1. Interface-supported atomic operations. The relevant platform communication interface to the SDXI function is capable and enabled to support atomic accesses to and from the SDXI function.
- 2. Function-supported atomic operations. The SDXI implementation supports the relevant set of atomic operations.

| 31 30 29 28 27 | 26 25 24 23 22 21 20 19 18 17 16 | 15 14 13 12 11 10 9 8 | 7 6 5 | 4 3 2 1 0             |        |
|----------------|----------------------------------|-----------------------|-------|-----------------------|--------|
| rsv            | type = 0x003                     | subtype               | rsv   | csr ch fe se vl       | + 0x00 |
|                | r                                | 5V                    |       | osz rsv               | + 0x04 |
|                | rsv                              |                       | rsv   | attr_dst              | + 0x08 |
|                | rsv                              | ake                   | ey0   |                       | + 0x0C |
|                | addr                             | D_dst                 |       | n rsv                 | + 0x10 |
|                |                                  |                       |       |                       | + 0x14 |
|                | ot                               | 01                    |       |                       | + 0x18 |
|                |                                  |                       |       |                       | + 0x1C |
|                | ot                               | 02                    |       |                       | + 0x20 |
|                |                                  |                       |       |                       | + 0x24 |
|                | ret_da                           | ita_ptr               |       | <mark>rsv</mark> i nr | + 0x28 |
|                |                                  |                       |       |                       | + 0x2C |
|                | rs                               | V                     |       |                       | + 0x30 |
|                |                                  |                       |       |                       | + 0x34 |
|                | csb                              | _ptr                  |       | rsv np                | + 0x38 |
|                |                                  |                       |       |                       | + 0x3C |

### Figure 6-8: DSC\_ATM Descriptor Format

### Table 6–10: DSC\_ATM<sup>[^7]</sup> Operation Descriptor Format

| Field             | Bits    | Subfield                                 | Description                                                                                                                                                                                                                                                                                                             |  |  |  |  |
|-------------------|---------|------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|--|
| u32 opcode;       | 000     | vl = 1                                   | Valid. See section "6.1.1".                                                                                                                                                                                                                                                                                             |  |  |  |  |
|                   | 001     | se = *                                   | Sequential Consistency. See section "6.1.1".                                                                                                                                                                                                                                                                            |  |  |  |  |
|                   | 002     | fe = *                                   | Fence. See section "6.1.1".                                                                                                                                                                                                                                                                                             |  |  |  |  |
|                   | 003     | ch = 0                                   | Chain. See section "6.1.1". Shall be set to 0.                                                                                                                                                                                                                                                                          |  |  |  |  |
|                   | 004     | csr                                      | Completion Status Mode Requirement for the descriptor.                                                                                                                                                                                                                                                                  |  |  |  |  |
|                   | 007:005 | rsvd                                     | Shall be set to 0                                                                                                                                                                                                                                                                                                       |  |  |  |  |
|                   | 015:008 | subtype = SWAP:0x1<br>UMIN:0xA, UMAX: 0x | , UADD:0x2, USUB:0x3, AND:0x5, OR:0x6, XOR:0x7, SMIN:0x8, SMAX:0x9,<br>B, UCLAMPI: 0xC, UCLAMPD:0xD, CMPSWAP:0xE                                                                                                                                                                                                        |  |  |  |  |
|                   | 026:016 | type=0x003                               | See section "6.1.1".                                                                                                                                                                                                                                                                                                    |  |  |  |  |
|                   | 031:027 | rsvd                                     | Shall be set to 0                                                                                                                                                                                                                                                                                                       |  |  |  |  |
| u32 size;         | 033:032 | rsvd                                     | Shall be set to 0                                                                                                                                                                                                                                                                                                       |  |  |  |  |
|                   | 036:034 | osz                                      | The size of the operands in 4-byte increments minus 1:<br>000b = 4 bytes, 001b = 8 bytes. All other encodings reserved                                                                                                                                                                                                  |  |  |  |  |
|                   | 063:037 | rsvd                                     | Shall be set to 0                                                                                                                                                                                                                                                                                                       |  |  |  |  |
| u8 attr;          | 067:064 | attr_dst                                 | Destination buffer attributes                                                                                                                                                                                                                                                                                           |  |  |  |  |
|                   | 071:068 | rsvd                                     | Shall be set to 0.                                                                                                                                                                                                                                                                                                      |  |  |  |  |
| u8 rsvd_0[3];     | 095:072 |                                          | Shall be set to 0.                                                                                                                                                                                                                                                                                                      |  |  |  |  |
| u16 akey0;        | 111:096 | (dst)                                    | Destination buffer AKey                                                                                                                                                                                                                                                                                                 |  |  |  |  |
| u8 rsvd_1[2];     | 127:112 |                                          | Shall be set to 0.                                                                                                                                                                                                                                                                                                      |  |  |  |  |
| u64 addr0;        | 129:128 | rsvd                                     | Shall be set to "00b" since addr0 must always be either 4-byte or 8-byte aligned.                                                                                                                                                                                                                                       |  |  |  |  |
|                   | 130     | n                                        | Shall be set to "0b" for 8-byte alignment if address0 is an 8-<br>byte operand (osz = "001b")                                                                                                                                                                                                                           |  |  |  |  |
|                   | 191:131 | addr0_dst                                | Destination buffer memory address to target for the atomic operation. The address shall be aligned to the operand size.                                                                                                                                                                                                 |  |  |  |  |
| u64 op1;          | 255:192 |                                          | Operand 1 Data. If the operand size is set to 4 bytes, the lower 32 bits of Operand1Data are utilized.                                                                                                                                                                                                                  |  |  |  |  |
| u64 op2;          | 319:256 |                                          | Operand 2 Data. If the operand size is set to 4 bytes, the lower 32 bits of op2 are utilized.                                                                                                                                                                                                                           |  |  |  |  |
| u64 ret_data_ptr; | 320     | nr                                       | No Return (nr)<br>When 0, the SDXI function shall return the original memory value<br>at address0 to the location pointed to by ret_data_ptr.<br>When 1, ret_data_ptr shall be ignored, and the original memory<br>value at address0 will not be returned. Software will set<br>ret_data_ptr to 0 when setting nr to 1. |  |  |  |  |
|                   | 321     | rsvd                                     | Shall be set to 0.                                                                                                                                                                                                                                                                                                      |  |  |  |  |
|                   | 383:322 | ret_data_ptr                             | Pointer to Atomic Return Data, a naturally aligned operand size<br>location in memory. The original memory value at address0 will<br>be written back to this location when the nr bit is clear. If the nr bit<br>is set, no data is returned.                                                                           |  |  |  |  |
| u8 rsvd_2[8];     | 447:384 |                                          | Shall be set to 0.                                                                                                                                                                                                                                                                                                      |  |  |  |  |
| u64 csb_ptr;      | 448     | np                                       | no_pointer. See section "6.1.1".                                                                                                                                                                                                                                                                                        |  |  |  |  |
|                   | 452:449 | rsvd                                     | Shall be set to 0                                                                                                                                                                                                                                                                                                       |  |  |  |  |
|                   | 511:453 | csb_ptr                                  | Completion Status Block pointer. See section "6.1.1".                                                                                                                                                                                                                                                                   |  |  |  |  |

| Sub<br>Type | Operation<br>Subset | Operation                                                                                                               |
|-------------|---------------------|-------------------------------------------------------------------------------------------------------------------------|
| 0x01        | Full, Min           | Swap (DSC_ATM_SWAP):<br>*address0 = operand1;                                                                           |
| 0x02        | Full, Min           | Unsigned Add (DSC_ATM_UADD) <sup>7</sup> :<br>*address0 += operand1;                                                    |
| 0x03        | Full                | Unsigned Subtract (DSC_ATM_USUB):<br>*address0 -= operand1;                                                             |
| 0x05        | Full                | AND (DSC_ATM_AND):<br>*address0 &= operand1;                                                                            |
| 0x06        | Full                | OR (DSC_ATM_OR):<br>*address0  = operand1;                                                                              |
| 0x07        | Full                | XOR (DSC_ATM_XOR):<br>*address0 ^= operand1;                                                                            |
| 0x08        | Full                | Signed Min (DSC_ATM_SMIN) <sup>5</sup> :<br>if (*address0 > operand1) { *address0 = operand1; }                         |
| 0x09        | Full                | Signed Max (DSC_ATM_SMAX) <sup>5</sup> :<br>if (*address0 < operand1) { *address0 = operand1; }                         |
| 0x0A        | Full                | Unsigned Min (DSC_ATM_UMIN) <sup>5</sup> :<br>if (*address0 > operand1) { *address0 = operand1; }                       |
| 0x0B        | Full                | Unsigned Max (DSC_ATM_UMAX) <sup>5</sup> :<br>if (*address0 < operand1) { *address0 = operand1; }                       |
| 0x0C        | Full                | Unsigned Increment (DSC_ATM_UINC):<br>*address0 = (*address0 >= operand1) ? 0 : (*address) + 1;                         |
| 0x0D        | Full                | Unsigned Decrement (DSC_ATM_UDEC):<br>*address0 = (*address0 == 0    *address0 > operand1) ? operand1 : (*address - 1); |
| 0x0E        | Full, Min           | Compare and Swap (DSC_ATM_CMPSWAP) <sup>5</sup> :<br>if (*address0 == operand1) { *address0 = operand2; }               |
|             | •                   | All other encodings are Reserved                                                                                        |

Table 6–11: Atomic Operations Sub-Type<sup>1,2,3,4</sup>

1. The sequence of reading and writing (or releasing) address0 by the SDXI function must be atomic with respect to operations of other agents on address0.

- 2. "\*nnn" means the memory contents pointed to by nnn.
- 3. All operations are performed at the specified operand size; all operands must be naturally aligned to that size.
- 4. When the "nr" field is zero, the SDXI function shall write the previous destination value (original memory value at address0) to the location pointed to by the ret\_data\_ptr field. if (nr == 0){ \*ret\_data\_ptr = \*address0; }
- 5. Note: Provided that the atomicity requirements are met, an SDXI implementation is permitted to write back the unchanged content of address0 when the condition is false.
- 6. The SDXI function may perform the write to the Atomic Return Data location (pointed to by the "ret\_data\_ptr" field) in any order with respect to the atomic update of the destination location. The SDXI function shall ensure that both the Atomic Return Data write and the destination update are completed and made globally visible before updating the descriptor's completion status block.
- 7. Note that DSC\_ATM\_UADD can effectively perform a subtract by supplying the two's complement of the value to subtract as the argument value to the operation since overflow is not detected.

## 6.4 IntrGrp Operation Group

## 6.4.1 IntrGrp DSC\_INTR Operation

The IntrGrp's Interrupt operation is used to generate an interrupt. The interrupt number and address space are specified in the referenced AKey.

| 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 | 15 14 13 12 11 10 9 8 | 7 6 5 4 3 2 1 0                  |        |
|-------------------------------------------------|-----------------------|----------------------------------|--------|
| rsv type = 0x004                                | subtype = 0x00        | rsv <mark>csr ch fe se vl</mark> | + 0x00 |
| 1                                               | SV                    |                                  | + 0x04 |
|                                                 |                       |                                  | + 0x08 |
| rsv                                             | ak                    | xey                              | + 0x0C |
|                                                 |                       |                                  | + 0x10 |
|                                                 |                       |                                  | + 0x14 |
|                                                 |                       |                                  | + 0x18 |
|                                                 |                       |                                  | + 0x1C |
|                                                 |                       |                                  | + 0x20 |
|                                                 |                       |                                  | + 0x24 |
|                                                 |                       |                                  | + 0x28 |
|                                                 |                       |                                  | + 0x2C |
|                                                 |                       |                                  | + 0x30 |
|                                                 |                       |                                  | + 0x34 |
| CS                                              | b_ptr                 | rsv np                           | + 0x38 |
|                                                 |                       |                                  | + 0x3C |

#### Figure 6-9: DSC\_INTR Descriptor Format

#### Table 6–12: DSC\_INTR<sup>[^7]</sup> Descriptor Format

| Field          | Bits    | Subfield     | Description                                            |
|----------------|---------|--------------|--------------------------------------------------------|
| u32 opcode;    | 000     | vl = 1       | Valid. See section "6.1.1".                            |
|                | 001     | se = *       | Sequential Consistency. See section "6.1.1".           |
|                | 002     | fe = *       | Fence. See section "6.1.1".                            |
|                | 003     | ch = 0       | Chain. See section "6.1.1". Shall be set to 0.         |
|                | 004     | csr          | Completion Status Mode Requirement for the descriptor. |
|                | 007:005 | rsvd         | Shall be set to 0                                      |
|                | 015:008 | subtype=0x00 | See section "6.1.1".                                   |
|                | 026:016 | type=0x004   | See section "6.1.1".                                   |
|                | 031:027 | rsvd         | Shall be set to 0                                      |
| u8 rsvd_0[8];  | 095:032 |              | Shall be set to 0.                                     |
| u16 akey;      | 111:096 |              | AKey used for this operation                           |
| u8 rsvd_1[42]; | 447:112 |              | Shall be set to 0.                                     |
| u64 csb_ptr;   | 448     | np           | no_pointer. See section "6.1.1".                       |
|                | 452:449 | rsvd         | Shall be set to 0                                      |
|                | 511:453 | csb_ptr      | Completion Status Block pointer. See section "6.1.1".  |

# 6.5 Vendor Defined Operation Group

Developer Note: A GUID-based DSC\_DISC discovery mechanism and function assigned opcodes scheme is planned to replace the Vendor Defined Operation Group feature. Refer to the SDXI 1.1 specification. Use of the SDXI 1.0 Vendor Defined Operation Group feature is discouraged.

The desired expansion model for new SDXI function operations is to create a new architectural operation group within this specification with architected operations. This prevents fragmentation of the architecture. But there may be rare circumstances where an SDXI function vendor has a need to create a non-standard, vendor-specific operation. Such an operation shall use the format of the DSC\_VENDOR descriptor. Using a standard format allows leveraging of the standard software ecosystem for SDXI.

| 31 30 29 28 27 26 25 24 23 22 21 20 19 | 18 17 16 | 15 14 13 | 12 11 10 | 98 | 7 | 6 5 | 5 4              | 3  | 2  | 1 0  |        |
|----------------------------------------|----------|----------|----------|----|---|-----|------------------|----|----|------|--------|
| rsv type                               |          | 9        | subtype  |    | r | rsv | <mark>csr</mark> | ch | fe | se v | + 0x00 |
|                                        | vend     | lor      |          |    |   |     |                  |    |    |      | + 0x04 |
|                                        |          |          |          |    |   |     |                  |    |    |      | + 0x08 |
|                                        |          |          |          |    |   |     |                  |    |    |      | + 0x0C |
|                                        |          |          |          |    |   |     |                  |    |    |      | + 0x10 |
|                                        |          |          |          |    |   |     |                  |    |    |      | + 0x14 |
|                                        |          |          |          |    |   |     |                  |    |    |      | + 0x18 |
|                                        |          |          |          |    |   |     |                  |    |    |      | + 0x1C |
|                                        |          |          |          |    |   |     |                  |    |    |      | + 0x20 |
|                                        |          |          |          |    |   |     |                  |    |    |      | + 0x24 |
|                                        |          |          |          |    |   |     |                  |    |    |      | + 0x28 |
|                                        |          |          |          |    |   |     |                  |    |    |      | + 0x2C |
|                                        |          |          |          |    |   |     |                  |    |    |      | + 0x30 |
|                                        |          |          |          |    |   |     |                  |    |    |      | + 0x34 |
|                                        | csb_     | _ptr     |          |    |   |     | <u> </u>         | rs | v  | n    | + 0x38 |
|                                        |          |          |          |    |   |     |                  |    |    |      | + 0x3C |

### Figure 6-10: DSC\_VENDOR Descriptor Format

### Table 6–13: DSC\_VENDOR<sup>[^3]</sup> Descriptor Format

| Field          | Bits    | Subfield | Description                                                                   |
|----------------|---------|----------|-------------------------------------------------------------------------------|
| u32 opcode;    | 000     | vl = 1   | Valid. See section "6.1.1".                                                   |
|                | 001     | se = *   | Sequential Consistency. See section "6.1.1".                                  |
|                | 002     | fe = *   | Fence. See section "6.1.1".                                                   |
|                | 003     | ch = 0   | Chain. See section "6.1.1". Shall be set to 0.                                |
|                | 004     | csr      | Completion Status Mode Requirement for the descriptor.                        |
|                | 007:005 | rsvd     | Shall be set to 0                                                             |
|                | 015:008 | subtype  | Vendor-Defined.                                                               |
|                | 026:016 | type     | Vendor-Defined but must be between 0x01F – 0x01C.                             |
|                | 031:027 | rsvd     | Shall be set to 0                                                             |
| u8 vendor[52]; | 447:032 |          | These bits are defined by the vendor based on the VendorID and SubType fields |
| u64 csb_ptr;   | 448     | np       | no_pointer. See section "6.1.1".                                              |
|                | 452:449 | rsvd     | Shall be set to 0                                                             |
|                | 511:453 | csb_ptr  | Completion Status Block pointer. See section "6.1.1".                         |

## 6.6 Administrative Operation Group (AdminGrp)

Administrative operations are used by privileged software to manage the SDXI function. A key set of such operations comprise the Administrative Operation Group (AdminGrp). Software shall issue AdminGrp operations only in Context 0 ("Administrative" Context) within each function; the SDXI function shall generate a context error if an AdminGrp operation is issued on any other context. AdminGrp operations are described below and in the following sections.

- The AdminGrp's DSC\_UPD\_[FN, CXT, AKEY, RKEY] update operations are used by privileged software to indicate changes to various memory data structures. Update operations shall be used even when transitioning data structures from the invalid to valid state. This is done to simplify software-emulated implementations.
- The AdminGrp's DSC\_SYNC operation is used as a barrier after one or more update operations. After completion of the DSC\_SYNC, the effects of prior AdminGrp updates to the structures selected by the DSC\_SYNC operation shall be seen by all new descriptors.
- The AdminGrp's DSC\_CXT\_START\_[NM, RS] and DSC\_CXT\_STOP operations may be used to control the execution state of a target context.
- The AdminGrp's DSC\_ADM\_INTR operation is used to generate interrupts from the local function hosting the administrative context.

## 6.6.1 Accessing Contexts, Akey Tables, and RKey Table by Index

An Administrative operation may operate on a context entry, an entry in an Akey table, or an entry in the RKey table; each of these entries are referenced by an index within their containing structure. For the function to operate on an indexed entry, the index must be within the specific limit for that structure. The function shall log an error when an operation specifies an index that is outside of the appropriate limits.

Many administrative operations may be applied to a range of entries within a data structure. The range is specified explicitly in the descriptor as a starting index "\_start" field and an ending index "\_end" field; implicitly the relevant index limit for the structure is also used. (For example, to operate on contexts 1 through 127 inclusive, software would set cxt\_start to "1" and cxt\_end to "127".) The range of the specified entries is operated upon as shown in the following pseudo-code.

```
Figure 6-11: Operation on Range of Entries
// Let entry_type indicate the type of referenced data-structure (CXT, AKEY, RKEY)
   if ( _start > _end
         || ( entry_type == CXT
               && ( MMIO_CTL2.max_cxt > MMIO_CAP1.max_cxt
                    || _end > MMIO_CTL2.max_cxt
                  )
            )
         || ( entry_type == AKEY
               && ( MMIO_CTL2.max_akey_sz > MMIO_CAP1.max_akey_sz
                    || CXT_L1_ENT.akey_sz > MMI0_CTL2.max_akey_sz
                    || _end >= 2**(8 + CXT_L1_ENT.akey_sz)
                  )
            )
         || ( entry_type == RKEY
               && ( MMIO_RKEY.tbl_sz > MMIO_CAP0.max_rkey_sz
                    || _end >= 2**(8 + MMIO_RKEY.tbl_sz)
                  )
            )
  ) {
         error();
   }
   // Asynchronous, Parallel Issue of Operation on Entries
   index = _start;
  while ( index <= _end ) {</pre>
     async_issue_operation( index );
      index++;
   }
```
### 6.6.2 Targeting Multiple Contexts with A Single Administrative Operation

Many administrative operations are defined such that a single operation may target the entire function or selected multiple contexts. For such an operation, the SDXI function may evaluate individual target contexts in any order and in parallel. The following describes the resulting architectural behavior.

Evaluating Contexts:

- 1. A successful completion or error result from operating on one context is unordered with respect to another targeted context.
- 2. The SDXI function may terminate an operation due to an individual context error at any point and may skip evaluation of remaining contexts. It is implementation-specific if an error detected in one context affects whether any other targeted context is evaluated.

Returning Results and Completion Status:

- 1. If the administrative operation results in an architectural state change or explicit synchronization barrier (DSC\_CXT\_START\_[NM, RS], DSC\_CXT\_STOP, and DSC\_SYNC), then the completion status is signaled in the following way. (Note, each operation shall specify which background actions-must be completed for the function to signal completion.)
  - a. If all target contexts have been operated upon successfully without error, the operation returns a successful completion status block.
  - b. If the operation encounters an error in at least one context, the operation will return an unsuccessful completion. Note, an operation shall only log one error even if the operation evaluates multiple contexts with error.
  - c. Regardless of overall success or failure, the SDXI function shall cease evaluating targeted contexts before generating a Completion Status Block and, if required, logging an error.
- 2. If the administrative operation is a data-structure update operation to any non-coherently cached copies of function state (DSC\_ADM\_UPD\_[FN, CXT, AKEY, RKEY]), then the completion status merely signals that the function has begun the update action. Completion of the operation does not imply completion of the requested update action(s). Further behavior follows for the update action(s).
  - a. Performing an update action on an invalid context shall have no effect and logs no error.
  - b. If all target contexts have been acted upon successfully without error, then no error log is generated.
  - c. If any update action encounters an error in at least one context, the update action will log an error. It is implementation-dependent if an update action may encounter an error and if multiple update action errors log one or more errors.

A number of operations are idempotent -- viz. they may be executed several times with the same input without changing the final result beyond its first execution. This may be useful if an operation targeting multiple contexts has failed and needs to be resubmitted after remediation. Consult the description of the actual operation for more details.

### 6.6.3 AdminGrp DSC\_CXT\_START\_[NM, RS] Operations

The AdminGrp's DSC\_CXT\_START\_[NM, RS] operation initiates background actions that start all valid target contexts by changing each context's CXT\_STS.state value from CXTV\_STOP\_[SW, FN] to CXTV\_RUN. The targeted contexts are described using the cxt\_start and cxt\_end fields. The operation may also optionally signal the context to run using the supplied doorbell\_value. The doorbell\_value is evaluated per the rules specified in "4.3.3, Doorbell Register and Context Signaling".

DSC\_CXT\_START\_NM is intended to be used by the privileged software that administers the target context. The operation transitions CXTV\_STOP\_SW, CXTV\_STOP\_FN, and CXTV\_RUN to CXTV\_RUN for valid contexts; it will log an error if the target context is not in one of the above states for valid contexts.

DSC\_CXT\_START\_RS is intended to be used by hypervisor and privileged software to restore a context to CXTV\_RUN that had been previously suspended to CXTV\_STOP\_FN when the function transitioned out of GSV\_ACTIVE. The operation transitions CXTV\_STOP\_FN and CXTV\_RUN to CXTV\_RUN for valid contexts; it shall skip and *not* log an error if the target context is not in one of the above states or the context is invalid. Note, that DSC\_CXT\_START\_RS shall skip contexts in the CXTV\_STOP\_SW state. For each context that the function evaluates for a start operation, the function executes the following ordered set of steps:

- 1. Performs the Context Valid check (ChkValid:Cxt) described in "4.3.2, Check Valid Context". If the check fails, the function shall not modify CXT\_STS.state, skip the remaining steps and further evaluation of the context, and take further action based on the type of start operation being executed and the ChkValid:Cxt failure signature.
  - For DSC\_CXT\_START\_NM with any ChkValid:Cxt failure signature, log an error.
  - For DSC\_CXT\_START\_RS with a ChkValid:Cxt failure signature of LogErr:Cxt, log an error.
  - For DSC\_CXT\_START\_RS with a ChkValid:Cxt failure signature of Invalid:Cxt, do not log an error.
- 2. Using the value of CXT\_STS.state obtained in the previous step, if the value of CXT\_STS.state is neither CXTV\_RUN nor CXTV\_STOP\_[SW, FN], the function shall perform the following actions based on the type of start operation being executed.
  - For DSC\_CXT\_START\_NM, log an error, do not modify CXT\_STS.state, and skip the remaining steps and further evaluation of the context.
  - For DSC\_CXT\_START\_RS, do not log an error, do not modify CXT\_STS.state, and skip the remaining steps and further evaluation of the context.
- 3. For DSC\_CXT\_START\_RS, if CXT\_STS.state is CXTV\_STOP\_SW, do not log an error, do not modify CXT\_STS.state, and skip the remaining steps and further evaluation of the context.
- 4. At this point, if CXT\_STS.state is either CXTV\_RUN or CXTV\_STOP\_[SW, FN]:
  - The function will issue an atomic write of the CXT\_STS.state to CXTV\_RUN.
  - The function will then ensure that the value of CXT\_STS.state is CXTV\_RUN. If it is not, the function shall log an error. The function may use any implementation-specific mechanism to ensure this including an atomic read back of CXT\_STS.state.
- 5. If the modification of CXT\_STS.state to CXTV\_RUN is successful, the context is started (transitioned to the CXTV\_RUN state).

If all target contexts have been started or skipped successfully without error, these operations return a successful completion; otherwise, they return failure and log errors. See *"5.5, Descriptor Completion"* for more details.

In the case of successful completion and only after the completion status block has been written, these operations are required to evaluate a successfully started context's descriptor ring for new descriptors only if a doorbell\_value is specified in the start descriptor. This is done in the descriptor by setting "dv" and placing a value in the doorbell\_value field. See *"4.3.3, Doorbell Register and Context Signaling"* for how

doorbell\_value is used. The function is not required to evaluate a started context's descriptor ring when "dv" is not set.

In the case where a start operation has failed, idempotency is ensured; the operation may be re-submitted several times with the same input without changing the final result beyond its first execution until the operation succeeds. This may be useful if an operation targeting multiple contexts has failed and needs to be resubmitted after remediation.

| 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 | 15 14 13 12 11 10 9 8 | 7 6 5 4 3 2 1 0                   |        |
|-------------------------------------------------|-----------------------|-----------------------------------|--------|
| rsv type = 0x002                                | subtype               | rsv <mark>csr ch fe+ se vl</mark> | + 0x00 |
| vf_num                                          | vf dv rsv             | rsv                               | + 0x04 |
| cxt_end                                         | cxt_                  | start                             | + 0x08 |
| r.                                              | sv                    |                                   | + 0x0C |
| db_                                             | value                 |                                   | + 0x10 |
|                                                 |                       |                                   | + 0x14 |
| r.                                              | sv                    |                                   | + 0x18 |
|                                                 |                       |                                   | + 0x1C |
|                                                 |                       |                                   | + 0x20 |
|                                                 |                       |                                   | + 0x24 |
|                                                 |                       |                                   | + 0x28 |
|                                                 |                       |                                   | + 0x2C |
|                                                 |                       |                                   | + 0x30 |
|                                                 |                       |                                   | + 0x34 |
| csk                                             | o_ptr                 | rsv np                            | + 0x38 |
|                                                 |                       |                                   | + 0x3C |

### Figure 6-12: DSC\_CXT\_START\_[NM, RS] Descriptor Format

#### Table 6–14: DSC\_CXT\_START<sup>[^7]</sup> Descriptor Format

| Field          | Bits    | Subfield      | Description                                                                                                                                                                                                    |
|----------------|---------|---------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u32 opcode;    | 000     | vl = 1        | Valid. See section <i>"6.1.1".</i>                                                                                                                                                                             |
|                | 001     | se = *        | Sequential Consistency. See section "6.1.1". Shall be set to 0.                                                                                                                                                |
|                | 002     | fe = 1        | Fence. See section "6.1.1".                                                                                                                                                                                    |
|                | 003     | ch = 0        | Chain. See section "6.1.1". Shall be set to 0.                                                                                                                                                                 |
|                | 004     | csr           | Completion Status Mode Requirement for the descriptor.                                                                                                                                                         |
|                | 007:005 | rsvd          | Shall be set to 0                                                                                                                                                                                              |
|                | 015:008 | subtype= NM:0 | x03, RS:0x08                                                                                                                                                                                                   |
|                | 026:016 | type=0x002    | See section "6.1.1".                                                                                                                                                                                           |
|                | 031:027 | rsvd          | Shall be set to 0                                                                                                                                                                                              |
| u8 rsvd_0;     | 039:032 |               | Shall be set to 0                                                                                                                                                                                              |
| u8 vflags;     | 045:040 | rsvd          | Shall be set to 0                                                                                                                                                                                              |
|                | 46      | dv            | doorbell_value valid.<br>When set to 1, after starting the context, the function evaluates<br>the context's descriptor ring for new descriptors using the<br>doorbell_value field.                             |
|                | 47      | vf            | "vf" valid.<br>1 = Update the VF number indicated by vf_number<br>0 = Update within the local function context executing this<br>descriptor<br>(For VF administrative contexts, this field shall be set to 0.) |
| u16 vf_num;    | 063:048 |               | "vf" number.<br>Only valid for PF administrative contexts. When VF=1, this field<br>indicates the VF to update.<br>(For VF administrative contexts, this field shall be set to 0.)                             |
| u16 cxt_start; | 079:064 |               | Indicates the starting context number to operate on.                                                                                                                                                           |
| u16 cxt_end;   | 095:080 |               | Indicates the ending (last) context number to operate on.                                                                                                                                                      |
| u8 rsvd_1[4];  | 127:096 |               | Shall be set to 0                                                                                                                                                                                              |
| u64 db_value;  | 191:128 |               | 64-bit doorbell_value used when "dv" is "1".                                                                                                                                                                   |
| u8 rsvd_2[32]; | 447:192 |               | Shall be set to 0                                                                                                                                                                                              |
| u64 csb_ptr;   | 448     | np            | no_pointer. See section <i>"6.1.1".</i>                                                                                                                                                                        |
|                | 452:449 | rsvd          | Shall be set to 0                                                                                                                                                                                              |
|                | 511:453 | csb_ptr       | Completion Status Block pointer. See section "6.1.1".                                                                                                                                                          |

### 6.6.4 AdminGrp DSC\_CXT\_STOP Operation

The DSC\_CXT\_STOP operation initiates background actions that stop all target contexts from processing new descriptors and changes each target context's state to a relevant final stopped state. The target context shall be stopped at a descriptor boundary. See "4.3.5, Function and Context Stop Actions" for *m*ore details. The targeted contexts are described using the cxt\_start and cxt\_end fields. The DSC\_CXT\_STOP operation shall complete in a timely manner so that further administrative operations may be executed.

The SDXI function shall signal completion of the operation when it determines whether the required background stopping actions can be initiated without error. The operation's completion does not indicate the success nor failure of the background stopping actions for the target context(s) nor does it imply whether the stopping actions have begun or finished.

When the stopping actions finish for a valid context, the CXT\_STS.state value shall be either: CXTV\_STOPG\_[SW, FN]; or CXTV\_ERR\_FN if a background stopping action encounters an error or abort in stopping a context. If CXTV\_ERR\_FN is returned, an error shall be logged. The stopping actions initiated by this operation ignore invalid contexts (CXTV\_INVALID).

Software may determine stop completion by one of these methods:

- Submit a DSC\_SYNC.STOP operation and wait for its completion.
- Check each target context's CXT\_STS.state to determine whether the target context is at CXTV\_STOP\_[SW, FN] or CXTV\_ERR\_FN.

The DSC\_CXT\_STOP operation specifies a context completion wait policy using the Hard Stop (HS) bit in the descriptor. When HS = 0, the function shall perform a soft-stop completion wait. When HS = 1, the function shall perform a hard-stop completion wait. See *"4.3.5, Function and Context Stop Actions"* for details.

Multiple DSC\_CXT\_STOP operations may be submitted against the same context regardless of the value of its CXT\_STS.state and the progress of previous stop actions upon it. This allows software to initiate a subsequent hard-stop completion if a previous soft-stop completion is not making acceptable forward progress.

But note that if a DSC\_SYNC.STOP operation is issued between two DSC\_CXT\_STOP operations targeting the same context, the second DSC\_CXT\_STOP shall have no effect on the stopping actions initiated by the first DSC\_CXT\_STOP operation.

In the case where the DSC\_CXT\_STOP operation has failed, idempotency is ensured; the operation may be re-submitted several times with the same input without changing the final result beyond its first execution until the operation succeeds. This may be useful if an operation targeting multiple contexts has failed and needs to be resubmitted after remediation.

From the time the operation is submitted until the time that the CXT\_STS.state of a targeted valid context becomes CXTV\_STOP\_[SW, FN] or CXTV\_ERR\_FN, software shall not modify any memory-based data structure encompassed by LVL\_CXT\_CTL. Doing so may cause the context to enter undefined operation. (See "4.3.1.2, Software Procedure For Modifying Memory-Based Data Structures")

Note that this operation is fenced with respect to prior descriptor operations on the same administrative context. See "6.1.1, Common Header and Footer" for details.

| 31 30 29 28 27 | 26 25 24 23 22 21 20 19 18 17 16 | 15 14 13 12 11 10 9 8                   | 7 6 5 4 3 2 1 0                   |        |
|----------------|----------------------------------|-----------------------------------------|-----------------------------------|--------|
| rsv            | type = 0x002                     | subtype = 0x04                          | rsv <mark>csr ch</mark> fe+ se vl | + 0x00 |
|                | vf_num                           | <mark>vf</mark> rsv <mark>hs</mark> rsv | rsv                               | + 0x04 |
|                | cxt_end                          | cxt_                                    | start                             | + 0x08 |
|                | rs                               | 5V                                      |                                   | + 0x0C |
|                |                                  |                                         |                                   | + 0x10 |
|                |                                  |                                         |                                   | + 0x14 |
|                |                                  |                                         |                                   | + 0x18 |
|                |                                  |                                         |                                   | + 0x1C |
|                |                                  |                                         |                                   | + 0x20 |
|                |                                  |                                         |                                   | + 0x24 |
|                |                                  |                                         |                                   | + 0x28 |
|                |                                  |                                         |                                   | + 0x2C |
|                |                                  |                                         |                                   | + 0x30 |
|                |                                  |                                         |                                   | + 0x34 |
|                | csb                              | _ptr                                    | rsv <mark>np</mark>               | + 0x38 |
|                |                                  |                                         |                                   | + 0x3C |

#### Figure 6-13: DSC\_CXT\_STOP Descriptor Format

| Table 6–15: | DSC_CXT | _STOP <sup>[^7]</sup> Desc | riptor Format |
|-------------|---------|----------------------------|---------------|
|-------------|---------|----------------------------|---------------|

| Field          | Bits    | Subfield     | Description                                                                                  |
|----------------|---------|--------------|----------------------------------------------------------------------------------------------|
| u32 opcode;    | 000     | vl = 1       | Valid. See section "6.1.1".                                                                  |
|                | 001     | se = *       | Sequential Consistency. See section "6.1.1". Shall be set to 0.                              |
|                | 002     | fe = 1       | Fence. See section "6.1.1".                                                                  |
|                | 003     | ch = 0       | Chain. See section "6.1.1". Shall be set to 0.                                               |
|                | 004     | csr          | Completion Status Mode Requirement for the descriptor.                                       |
|                | 007:005 | rsvd         | Shall be set to 0                                                                            |
|                | 015:008 | subtype=0x04 | See section "6.1.1".                                                                         |
|                | 026:016 | type=0x002   | See section "6.1.1".                                                                         |
|                | 031:027 | rsvd         | Shall be set to 0                                                                            |
| u8 rsvd_0;     | 039:032 |              | Shall be set to 0                                                                            |
| u8 vflags;     | 044:040 | rsvd         | Shall be set to 0                                                                            |
|                | 045     | hs           | 0 = Soft-Stop Completion.                                                                    |
|                | 46      | roud         | Shell be get to 0                                                                            |
|                | 40      | Isvu         |                                                                                              |
|                | 47      | vf           | "vf" valid.<br>1 = Update the VF number indicated by vf_number                               |
|                |         |              | 0 = Update within the local function context executing this descriptor                       |
|                |         |              | (For VF administrative contexts, this field shall be set to 0.)                              |
| u16 vf_num;    | 063:048 |              | "vf" number.                                                                                 |
|                |         |              | Only valid for PF administrative contexts. When VF=1, this field indicates the VF to update. |
|                |         |              | (For VF administrative contexts, this field shall be set to 0.)                              |
| u16 cxt_start; | 079:064 |              | Indicates the starting context number to operate on.                                         |
| u16 cxt_end;   | 095:080 |              | Indicates the ending (last) context number to operate on.                                    |
| u8 rsvd_1[44]; | 447:096 |              | Shall be set to 0                                                                            |
| u64 csb_ptr;   | 448     | np           | no_pointer. See section "6.1.1".                                                             |
|                | 452:449 | rsvd         | Shall be set to 0                                                                            |
|                | 511:453 | csb_ptr      | Completion Status Block pointer. See section "6.1.1".                                        |

### 6.6.5 AdminGrp DSC\_AKEY\_UPD Operation

The AdminGrp's DSC\_AKEY\_UPD operation is used by software to indicate a change to one or more AKey table entries associated with one or more contexts of a target function. This operation signals the target SDXI function to update its internal copies, if any, of the selected AKey table entries. (See "4.3.1, SDXI Memory-Based Data-Structure Hierarchy and Caching")

The affected contexts are described using the cxt\_start and cxt\_end fields. The affected AKey table indices are selected using the akey\_start and akey\_end fields. Note that a successful completion of a DSC\_AKEY\_UPD operation only indicates that background update actions shall be started; it does not indicate that the update actions have completed nor that their effects are visible. Software shall use the DSC\_SYNC operation to ensure that the background update actions have completed.

| 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 | 15 14 13 12 11 10 9 8 | 7 6 5 4 3 2 1 0                  |        |
|-------------------------------------------------|-----------------------|----------------------------------|--------|
| rsv type = 0x002                                | subtype = 0x02        | rsv <mark>csr ch fe se vl</mark> | + 0x00 |
| vf_num                                          | vf rsv                | rsv                              | + 0x04 |
| cxt_end                                         | cxt_s                 | tart                             | + 0x08 |
| akey_end                                        | akey_                 | start -                          | + 0x0C |
| rs                                              | v                     |                                  | + 0x10 |
|                                                 |                       |                                  | + 0x14 |
|                                                 |                       |                                  | + 0x18 |
|                                                 |                       | -                                | + 0x1C |
|                                                 |                       |                                  | + 0x20 |
|                                                 |                       |                                  | + 0x24 |
|                                                 |                       |                                  | + 0x28 |
|                                                 |                       | -                                | + 0x2C |
|                                                 |                       |                                  | + 0x30 |
|                                                 |                       |                                  | + 0x34 |
| csb                                             | ptr                   | rsv <mark>np</mark>              | + 0x38 |
|                                                 |                       | -                                | + 0x3C |

#### Figure 6-14: DSC\_AKEY\_UPD Descriptor Format

#### Table 6–16: DSC\_ADM\_AKEY\_UPD<sup>[^7]</sup> Descriptor Format

| Field           | Bits    | Subfield     | Description                                                                                                                                                                                                    |  |  |  |
|-----------------|---------|--------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|
| u32 opcode;     | 000     | vl = 1       | Valid. See section "6.1.1".                                                                                                                                                                                    |  |  |  |
|                 | 001     | se = *       | Sequential Consistency. See section "6.1.1". Shall be set to 0.                                                                                                                                                |  |  |  |
|                 | 002     | fe = *       | Fence. See section "6.1.1".                                                                                                                                                                                    |  |  |  |
|                 | 003     | ch = 0       | Chain. See section "6.1.1". Shall be set to 0.                                                                                                                                                                 |  |  |  |
|                 | 004     | csr          | Completion Status Mode Requirement for the descriptor.                                                                                                                                                         |  |  |  |
|                 | 007:005 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |  |  |
|                 | 015:008 | subtype=0x02 | See section "6.1.1".                                                                                                                                                                                           |  |  |  |
|                 | 026:016 | type=0x002   | See section "6.1.1".                                                                                                                                                                                           |  |  |  |
|                 | 031:027 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |  |  |
| u8 rsvd_0;      | 039:032 |              | Shall be set to 0                                                                                                                                                                                              |  |  |  |
| u8 vflags;      | 046:040 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |  |  |
|                 | 47      | vf           | "vf" valid.<br>1 = Update the VF number indicated by vf_number<br>0 = Update within the local function context executing this<br>descriptor<br>(For VF administrative contexts, this field shall be set to 0.) |  |  |  |
| u16 vf_num;     | 063:048 |              | "vf" number.<br>Only valid for PF administrative contexts. When VF=1, this field<br>indicates the VF to update.<br>(For VF administrative contexts, this field shall be set to 0.)                             |  |  |  |
| u16 cxt_start;  | 079:064 |              | Indicates the starting context number to operate on.                                                                                                                                                           |  |  |  |
| u16 cxt_end;    | 095:080 |              | Indicates the ending (last) context number to operate on.                                                                                                                                                      |  |  |  |
| u16 akey_start; | 111:096 |              | Indicates the starting AKey number to operate on.                                                                                                                                                              |  |  |  |
| u16 akey_end;   | 127:112 |              | Indicates the ending (last) AKey number to operate on.                                                                                                                                                         |  |  |  |
| u8 rsvd_1[40];  | 447:128 |              | Shall be set to 0                                                                                                                                                                                              |  |  |  |
| u64 csb_ptr;    | 448     | np           | no_pointer. See section "6.1.1".                                                                                                                                                                               |  |  |  |
|                 | 452:449 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |  |  |
|                 | 511:453 | csb_ptr      | Completion Status Block pointer. See section "6.1.1".                                                                                                                                                          |  |  |  |

### 6.6.6 AdminGrp DSC\_CXT\_UPD Operation

The AdminGrp's DSC\_CXT\_UPD operation is used by software to indicate a change to memory data structures describing one or more contexts of a target function. For each context specified, the operation signals the function to start the updating of all internal copies, if any, of all memory-based data structures and subsidiary structures associated with the context at the specified data-structure hierarchy level. (See *"4.3.1, SDXI Memory-Based Data-Structure Hierarchy and Caching"*.) The targeted contexts are described using the cxt\_start and cxt\_end fields. The data-structure level (dsl) field within the descriptor determines whether the LVL\_L2, LVL\_L1, or LVL\_CXT\_CTL level associated with the context is updated. When selecting LVL\_L2, software shall specify cxt\_start as the first context number mapped to the CXT\_L2\_ENT and cxt\_end as the last function-supported context mapped to the CXT\_L2\_ENT. This is because one CXT\_L2\_ENT points to a naturally-aligned range of 128 contexts. For example, assume that the function has been configured through MMIO\_CAP1 and MMIO\_CTL2 to support contexts 0 to 257. Updating the CXT\_L2\_ENT that maps context numbers 128 to 255, requires a cxt\_start of 128 and a cxt\_end of 255. Whereas, updating the CXT\_L2\_ENT that maps context numbers 256 and 257 requires a cxt\_start of 256 and a cxt\_end of 257.

The function shall not return an error when performing an update on a context that is CXTV\_INVALID.

For any requested level of the memory-based data-structure hierarchy to update, an SDXI implementation may update any associated higher level of the hierarchy. Note that a successful completion of a DSC\_CXT\_UPD operation only indicates that background update actions shall be started; it does not indicate that the update actions have completed nor that their effects are visible. Software shall use the DSC\_SYNC operation to ensure that the background update actions have completed.

| dsl[2:0]         | Targeted Data-Structure Level (See "4.3.1, SDXI Memory-Based Data-<br>Structure Hierarchy and Caching") |
|------------------|---------------------------------------------------------------------------------------------------------|
| 000b             | Reserved                                                                                                |
| 100b             | DSC_CXT_UPD.CTL: Updating LVL_CXT_CTL fields.                                                           |
| 110b             | DSC_CXT_UPD.L1: Updating LVL_L1 fields.                                                                 |
| 111b             | DSC_CXT_UPD.L2: Updating LVL_L2 fields.                                                                 |
| All other encodi | ngs are reserved.                                                                                       |

| Table 6–17: | Targeted | Data-Structure | l evel |
|-------------|----------|----------------|--------|
|             | Targeteu | Data-Structure | LCACI  |

| Figure 6-15: DSC_CXT_UPD Descriptor Format |                                  |      |                    |      |     |     |    |      |    |      |     |        |
|--------------------------------------------|----------------------------------|------|--------------------|------|-----|-----|----|------|----|------|-----|--------|
| 31 30 29 28 27                             | 26 25 24 23 22 21 20 19 18 17 16 | 15   | 14 13 12 11 10 9 8 | 7    | 6   | 5   | 4  | 3    | 2  | 1    | 0   |        |
| rsv                                        | type = 0x002                     |      | subtype = 0x01     |      | rsv |     | cs | r ch | fe | se   | vl  | + 0x00 |
|                                            | vf_num                           | vf   | rsv                |      |     | rs۱ | /  |      | tį | gt_p | ocd | + 0x04 |
|                                            | cxt_end                          |      | cxt_               | star | t   |     |    |      |    |      |     | + 0x08 |
|                                            | rs                               | SV   |                    |      |     |     |    |      |    |      |     | + 0x0C |
|                                            |                                  |      |                    |      |     |     |    |      |    |      |     | + 0x10 |
|                                            |                                  |      |                    |      |     |     |    |      |    |      |     | + 0x14 |
|                                            |                                  |      |                    |      |     |     |    |      |    |      |     | + 0x18 |
|                                            |                                  |      |                    |      |     |     |    |      |    |      |     | + 0x1C |
|                                            |                                  |      |                    |      |     |     |    |      |    |      |     | + 0x20 |
|                                            |                                  |      |                    |      |     |     |    |      |    |      |     | + 0x24 |
|                                            |                                  |      |                    |      |     |     |    |      |    |      |     | + 0x28 |
|                                            |                                  |      |                    |      |     |     |    |      |    |      |     | + 0x2C |
|                                            |                                  |      |                    |      |     |     |    |      |    |      |     | + 0x30 |
|                                            |                                  |      |                    |      |     |     |    |      |    |      |     | + 0x34 |
|                                            | csb                              | _ptr |                    |      |     |     | ļ_ | r    | sv |      | np  | + 0x38 |
|                                            |                                  |      |                    |      |     |     |    |      |    |      |     | + 0x3C |

| Field          | Bits    | Subfield     | Description                                                                                                                                                                                                    |
|----------------|---------|--------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u32 opcode;    | 000     | vl = 1       | Valid. See section "6.1.1".                                                                                                                                                                                    |
|                | 001     | se = *       | Sequential Consistency. See section "6.1.1". Shall be set to 0.                                                                                                                                                |
|                | 002     | fe = *       | Fence. See section "6.1.1".                                                                                                                                                                                    |
|                | 003     | ch = 0       | Chain. See section "6.1.1". Shall be set to 0.                                                                                                                                                                 |
|                | 004     | csr          | Completion Status Mode Requirement for the descriptor.                                                                                                                                                         |
|                | 007:005 | rsvd         | Shall be set to 0                                                                                                                                                                                              |
|                | 015:008 | subtype=0x01 | See section "6.1.1".                                                                                                                                                                                           |
|                | 026:016 | type=0x002   | See section "6.1.1".                                                                                                                                                                                           |
|                | 031:027 | rsvd         | Shall be set to 0                                                                                                                                                                                              |
| u8 cflags;     | 034:032 | tgt_pcd      | Targeted Private-Cached Data to Update. See <i>"Table 6–17: Targeted Data-Structure</i> Level" for details.                                                                                                    |
|                | 039:035 | rsvd         | Shall be set to 0                                                                                                                                                                                              |
| u8 vflags;     | 046:040 | rsvd         | Shall be set to 0                                                                                                                                                                                              |
|                | 47      | vf           | "vf" valid.<br>1 = Update the VF number indicated by vf_number<br>0 = Update within the local function context executing this<br>descriptor<br>(For VF administrative contexts, this field shall be set to 0.) |
| u16 vf_num;    | 063:048 |              | "vf" number.<br>Only valid for PF administrative contexts. When VF=1, this field<br>indicates the VF to update.<br>(For VF administrative contexts, this field shall be set to 0.)                             |
| u16 cxt_start; | 079:064 |              | Indicates the starting context number to operate on.                                                                                                                                                           |
| u16 cxt_end;   | 095:080 |              | Indicates the ending (last) context number to operate on.                                                                                                                                                      |
| u8 rsvd_0[44]; | 447:096 |              | Shall be set to 0                                                                                                                                                                                              |
| u64 csb_ptr;   | 448     | np           | no_pointer. See section "6.1.1".                                                                                                                                                                               |
|                | 452:449 | rsvd         | Shall be set to 0                                                                                                                                                                                              |
|                | 511:453 | csb_ptr      | Completion Status Block pointer. See section "6.1.1".                                                                                                                                                          |

#### Table 6–18: DSC\_CXT\_UPD<sup>[^7]</sup> Descriptor Format

### 6.6.7 AdminGrp: DSC\_FN\_UPD Operation

The AdminGrp's DSC\_FN\_UPD operation is used by software to indicate a non-specific change in memory data structure contents. This operation signals the target SDXI function to invalidate all non-coherently cached copies of SDXI data structures across all contexts (LVL\_FN).

Note that a successful completion of a DSC\_FN\_UPD operation only indicates that background update actions shall be started; it does not indicate that the update actions have completed nor that their effects are visible. Software shall use the DSC\_SYNC operation to ensure that the background update actions have completed.

| 31 30 29 28 27 | 26 25 24 23 22 21 20 19 18 17 16 | 15 14 13 12 11 10 9 8 | 7 6 5 4 3 2 1 0                  |        |
|----------------|----------------------------------|-----------------------|----------------------------------|--------|
| rsv            | type = 0x002                     | subtype = 0x00        | rsv <mark>csr ch fe se vl</mark> | + 0x00 |
|                | vf_num                           | vf rsv                | rsv                              | + 0x04 |
|                | rs                               | SV                    |                                  | + 0x08 |
|                |                                  |                       |                                  | + 0x0C |
|                |                                  |                       |                                  | + 0x10 |
|                |                                  |                       |                                  | + 0x14 |
|                |                                  |                       |                                  | + 0x18 |
|                |                                  |                       |                                  | + 0x1C |
|                |                                  |                       |                                  | + 0x20 |
|                |                                  |                       |                                  | + 0x24 |
|                |                                  |                       |                                  | + 0x28 |
|                |                                  |                       |                                  | + 0x2C |
|                |                                  |                       |                                  | + 0x30 |
|                |                                  |                       |                                  | + 0x34 |
|                | csb                              | _ptr                  | rsv nr                           | + 0x38 |
|                |                                  |                       |                                  | + 0x3C |

### Figure 6-16: DSC\_FN\_UPD Descriptor Format

| Field          | Bits    | Subfield     | Description                                                                                                                                                                                                    |  |
|----------------|---------|--------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
| u32 opcode;    | 000     | vl = 1       | Valid. See section "6.1.1".                                                                                                                                                                                    |  |
|                | 001     | se = *       | Sequential Consistency. See section "6.1.1". Shall be set to 0.                                                                                                                                                |  |
|                | 002     | fe = *       | Fence. See section "6.1.1".                                                                                                                                                                                    |  |
|                | 003     | ch = 0       | Chain. See section "6.1.1". Shall be set to 0.                                                                                                                                                                 |  |
|                | 004     | csr          | Completion Status Mode Requirement for the descriptor.                                                                                                                                                         |  |
|                | 007:005 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |
|                | 015:008 | subtype=0x00 | See section "6.1.1".                                                                                                                                                                                           |  |
|                | 026:016 | type=0x002   | See section "6.1.1".                                                                                                                                                                                           |  |
|                | 031:027 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |
| u8 rsvd_0;     | 039:032 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |
| u8 vflags;     | 046:040 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |
|                | 47      | vf           | "vf" valid.<br>1 = Update the VF number indicated by vf_number<br>0 = Update within the local function context executing this<br>descriptor<br>(For VF administrative contexts, this field shall be set to 0.) |  |
| u16 vf_num;    | 063:048 |              | "vf" number.<br>Only valid for PF administrative contexts. When VF=1, this field<br>indicates the VF to update.<br>(For VF administrative contexts, this field shall be set to 0.)                             |  |
| u8 rsvd_1[48]; | 447:064 |              | Shall be set to 0                                                                                                                                                                                              |  |
| u64 csb_ptr;   | 448     | np           | no_pointer. See section "6.1.1".                                                                                                                                                                               |  |
|                | 452:449 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |
|                | 511:453 | csb_ptr      | Completion Status Block pointer. See section "6.1.1".                                                                                                                                                          |  |

 Table 6–19:
 DSC\_FN\_UPD<sup>[^7]</sup>
 Operation Descriptor Format

### 6.6.8 AdminGrp DSC\_RKEY\_UPD Operation

The AdminGrp's DSC\_RKEY\_UPD operation is used by software to indicate a change to one or more RKey table entries. This operation signals the target SDXI function to update its internal copies, if any, of the selected RKey table entries. (See "4.3.1, SDXI Memory-Based Data-Structure Hierarchy and Caching") The affected RKey table indices are selected using the rkey\_start and rkey\_end fields.

Note that a successful completion of a DSC\_RKEY\_UPD operation only indicates that background update actions shall be started; it does not indicate that the update actions have completed nor that their effects are visible. Software shall use the DSC\_SYNC operation to ensure that the background update actions have completed.

| 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 | 15 14 13 12 11 10 9 8 | 7 6 5 4 3 2 1 0                  |        |
|-------------------------------------------------|-----------------------|----------------------------------|--------|
| rsv type = 0x002                                | subtype = 0x07        | rsv <mark>csr ch fe se vl</mark> | + 0x00 |
| vf_num                                          | vf rsv                | rsv                              | + 0x04 |
| rs                                              | V                     |                                  | + 0x08 |
| rkey_end                                        | rkey_                 | start                            | + 0x0C |
| rs                                              | v                     |                                  | + 0x10 |
|                                                 |                       |                                  | + 0x14 |
|                                                 |                       |                                  | + 0x18 |
|                                                 |                       |                                  | + 0x1C |
|                                                 |                       |                                  | + 0x20 |
|                                                 |                       |                                  | + 0x24 |
|                                                 |                       |                                  | + 0x28 |
|                                                 |                       |                                  | + 0x2C |
|                                                 |                       |                                  | + 0x30 |
|                                                 |                       |                                  | + 0x34 |
| csb                                             | _ptr                  | rsv np                           | + 0x38 |
|                                                 |                       |                                  | + 0x3C |

### Figure 6-17: DSC\_RKEY\_UPD Descriptor Format

|  | Table 6–20: | DSC_RKEY | UPD <sup>[^7]</sup> Descriptor Format |
|--|-------------|----------|---------------------------------------|
|--|-------------|----------|---------------------------------------|

| Field           | Bits    | Subfield     | Description                                                                                                                                                                                                    |  |
|-----------------|---------|--------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|
| u32 opcode;     | 000     | vl = 1       | Valid. See section "6.1.1".                                                                                                                                                                                    |  |
|                 | 001     | se = *       | Sequential Consistency. See section "6.1.1". Shall be set to 0.                                                                                                                                                |  |
|                 | 002     | fe = *       | Fence. See section "6.1.1".                                                                                                                                                                                    |  |
|                 | 003     | ch = 0       | Chain. See section "6.1.1". Shall be set to 0.                                                                                                                                                                 |  |
|                 | 004     | csr          | Completion Status Mode Requirement for the descriptor.                                                                                                                                                         |  |
|                 | 007:005 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |
|                 | 015:008 | subtype=0x07 | See section "6.1.1".                                                                                                                                                                                           |  |
|                 | 026:016 | type=0x002   | See section "6.1.1".                                                                                                                                                                                           |  |
|                 | 031:027 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |
| u8 rsvd_0;      | 039:032 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |
| u8 vflags;      | 046:040 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |
|                 | 47      | vf           | "vf" valid.<br>1 = Update the VF number indicated by vf_number<br>0 = Update within the local function context executing this<br>descriptor<br>(For VF administrative contexts, this field shall be set to 0.) |  |
| u16 vf_num;     | 063:048 |              | "vf" number.<br>Only valid for PF administrative contexts. When VF=1, this field<br>indicates the VF to update.<br>(For VF administrative contexts, this field shall be set to 0.)                             |  |
| u8 rsvd_1[4];   | 095:064 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |
| u16 rkey_start; | 111:096 |              | Indicates the starting RKey number to operate on.                                                                                                                                                              |  |
| u16 rkey_end;   | 127:112 |              | Indicates the ending (last) RKey number to operate on.                                                                                                                                                         |  |
| u8 rsvd_2[40];  | 447:128 |              | Shall be set to 0                                                                                                                                                                                              |  |
| u64 csb_ptr;    | 448     | np           | no_pointer. See section "6.1.1".                                                                                                                                                                               |  |
|                 | 452:449 | rsvd         | Shall be set to 0                                                                                                                                                                                              |  |
|                 | 511:453 | csb_ptr      | Completion Status Block pointer. See section "6.1.1".                                                                                                                                                          |  |

### 6.6.9 AdminGrp DSC\_SYNC Operation

The AdminGrp's DSC\_SYNC operation acts as a filtered synchronization barrier for prior DSC\_UPD operations, DSC\_CXT\_STOP operations, and background stopping actions targeting a specific function. The filter field in the DSC\_SYNC descriptor specifies which data structures updates and background actions are ordered with respect to the synchronization barrier for the targeted function.

A DSC\_SYNC operation is issued from the administrative context of a source function and synchronizes against the specified operations and actions previously applied to a target function. When the "vf" field of the operation is "0", both the source and target are the same local function (referred to as a "LOC" operation). When the "vf" field is "1", the source function is a PF and the target function is a VF specified by the "vf\_number" field (referred to as a "P2V" operation).

The DSC\_SYNC synchronization barrier is applied as follows.

- 1. For all DSC\_SYNC filters except STOP, a DSC\_SYNC operation shall ensure synchronization with previous DSC\_UPD operations that match the filter and are issued from the same source function acting upon the same target function. Consider the following examples where the filter matches.
  - a. A DSC\_SYNC operation is ensured to synchronize with a previous DSC\_UPD when both operations have "vf" as "0" and are issued on the same function.
  - b. A DSC\_SYNC operation is ensured to synchronize with a previous DSC\_UPD when both operations have "vf" as "1", are issued on the same PF, and target the same "vf\_number".
  - c. A DSC\_SYNC operation is not ensured to synchronize with a previous DSC\_UPD when one operation is issued by the PF targeting a VF ("vf" is "1") and the other has been locally issued ("vf" is "0") on the same VF.
  - d. A DSC\_SYNC operation is not ensured to synchronize with a previous DSC\_UPD when both operations have "vf" as "1", are issued on the same PF, but target a different "vf\_number".
- 2. For a DSC\_SYNC.STOP, the operation shall ensure synchronization with the stopping of the specified contexts on the target function regardless of the source of the stopping action. Consider the following examples where the filter matches.
  - a. A DSC\_SYNC.STOP operation with "vf" as "0" is ensured to synchronize with the stopping of a context on the same function.
  - b. A DSC\_SYNC.STOP operation issued by the PF targeting a context on a VF ("vf" is "1") is ensured to synchronize with the local stopping of the same context on the same VF.
  - c. A DSC\_SYNC.STOP operation issued by the PF targeting a context on a VF ("vf" is "1") is not ensured to synchronize with the stopping of any context on the PF or a different VF.
- 3. For any specified filter and target function, an SDXI implementation may synchronize against any larger set of updates, actions, and functions that includes the specified filter and target function; software shall not rely upon this behavior.

A successful completion of the DSC\_SYNC operation indicates the successful completion of the specified data structures updates and background actions, and all previously received MMIO doorbells are fully evaluated (i.e. there are no outstanding data structure reads due to a prior doorbell). An error associated with DSC\_SYNC may indicate an error associated with the same updates and background actions.

The table below describes the filter field used by DSC\_SYNC.

| Filter  | Name              | Description                                                                                                                                                                                                                                                                                                                                                                                                                                             |
|---------|-------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 000b    | СХТ               | The operation synchronizes against context updates (DSC_CXT_UPD.L2, DSC_CXT_UPD.L1, DSC_CXT_UPD.CTL, and DSC_CXT_UPD.AKEY) for the contexts specified by the cxt_start and cxt_end fields. The key_start and key_end fields are ignored.                                                                                                                                                                                                                |
| 001b    | STOP              | The operation synchronizes against completed stop actions on the target function (DSC_CXT_STOP, StopErr:Cxt, and Stop_FN) for the contexts specified by the cxt_start and cxt_end fields. The key_start and key_end fields are ignored. Note that synchronizing against all function contexts for stopping is insufficient to determine that the function itself has transitioned to GSV_STOP; software must still check MMIO_STS0.fn_gsv for GSV_STOP. |
| 010b    | AKEY              | The operation synchronizes against AKey updates (DSC_AKEY_UPD) for the contexts specified by the cxt_start and cxt_end fields and the AKeys specified by the key_start and key_end fields.                                                                                                                                                                                                                                                              |
| 011b    | RKEY              | The operation synchronizes against RKey updates (DSC_RKEY_UPD) for the the RKeys specified by the key_start and key_end fields. The cxt_start and cxt_end fields are ignored.                                                                                                                                                                                                                                                                           |
| 100b    | Function (FN)     | The operation synchronizes against function updates (DSC_FN_UPD). The cxt_start, cxt_end, key_start, and key_end fields are ignored.                                                                                                                                                                                                                                                                                                                    |
| All oth | er values are res | erved.                                                                                                                                                                                                                                                                                                                                                                                                                                                  |

# Figure 6-18: DSC\_SYNC Descriptor Format 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0

| 51 50 25 20 27 20 25 24 25 22 21 20 | 13 10 1/ 10 13 | 14 13 12 11 10 3 0 | 7 0 3 4 3               | 2 2 2    |        |
|-------------------------------------|----------------|--------------------|-------------------------|----------|--------|
| rsv type = 0x00                     | 2              | subtype = 0x06     | rsv <mark>csr</mark> ch | fe se vl | + 0x00 |
| vf_num                              | vf             | rsv                | rsv                     | flt      | + 0x04 |
| cxt_end                             |                | cxt_               | start                   |          | + 0x08 |
| key_end                             |                | key_               | start                   |          | + 0x0C |
|                                     | rsv            |                    |                         |          | + 0x10 |
|                                     |                |                    |                         |          | + 0x14 |
|                                     |                |                    |                         |          | + 0x18 |
|                                     |                |                    |                         |          | + 0x1C |
|                                     |                |                    |                         |          | + 0x20 |
|                                     |                |                    |                         |          | + 0x24 |
|                                     |                |                    |                         |          | + 0x28 |
|                                     |                |                    |                         |          | + 0x2C |
|                                     |                |                    |                         |          | + 0x30 |
|                                     |                |                    |                         |          | + 0x34 |
|                                     | csb_ptr        |                    | rs                      | sv np    | + 0x38 |
|                                     |                |                    |                         |          | + 0x3C |

| Field          | Bits                           | Subfield     | Description                                                                                                                                                                                                                                       |
|----------------|--------------------------------|--------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u32 opcode;    | 000                            | vl = 1       | Valid. See section "6.1.1".                                                                                                                                                                                                                       |
|                | 001                            | se = *       | Sequential Consistency. See section "6.1.1". Shall be set to 0.                                                                                                                                                                                   |
|                | 002                            | fe = *       | Fence. See section "6.1.1".                                                                                                                                                                                                                       |
|                | 003                            | ch = 0       | Chain. See section "6.1.1". Shall be set to 0.                                                                                                                                                                                                    |
|                | 004                            | csr          | Completion Status Mode Requirement for the descriptor.                                                                                                                                                                                            |
|                | 007:005                        | rsvd         | Shall be set to 0                                                                                                                                                                                                                                 |
|                | 015:008                        | subtype=0x06 | See section "6.1.1".                                                                                                                                                                                                                              |
|                | 026:016                        | type=0x002   | See section "6.1.1".                                                                                                                                                                                                                              |
|                | 031:027                        | rsvd         | Shall be set to 0                                                                                                                                                                                                                                 |
| u8 cflags;     | 034:032                        | flt          | Indicates the filter to apply.                                                                                                                                                                                                                    |
|                | 039:035                        | rsvd         | Shall be set to 0                                                                                                                                                                                                                                 |
| u8 vflags;     | 046:040 rsvd Shall be set to 0 |              | Shall be set to 0                                                                                                                                                                                                                                 |
|                | 47                             | vf           | <ul> <li>"vf" valid.</li> <li>1 = Update the VF number indicated by vf_number</li> <li>0 = Update within the local function context executing this descriptor</li> <li>(For VF administrative contexts, this field shall be set to 0.)</li> </ul> |
| u16 vf_num;    | 063:048                        |              | "vf" number.<br>Only valid for PF administrative contexts. When VF=1, this field<br>indicates the VF to update.<br>(For VF administrative contexts, this field shall be set to 0.)                                                                |
| u16 cxt_start; | 079:064                        |              | Indicates the starting context number to operate on.                                                                                                                                                                                              |
| u16 cxt_end;   | 095:080                        |              | Indicates the ending (last) context number to operate on.                                                                                                                                                                                         |
| u16 key_start; | 111:096                        |              | Indicates the starting key number to operate on.                                                                                                                                                                                                  |
| u16 key_end;   | 127:112                        |              | Indicates the ending (last) key number to operate on.                                                                                                                                                                                             |
| u8 rsvd_0[40]; | 447:128                        |              | Shall be set to 0                                                                                                                                                                                                                                 |
| u64 csb_ptr;   | 448                            | np           | no_pointer. See section "6.1.1".                                                                                                                                                                                                                  |
|                | 452:449                        | rsvd         | Shall be set to 0                                                                                                                                                                                                                                 |
|                | 511:453                        | csb_ptr      | Completion Status Block pointer. See section "6.1.1".                                                                                                                                                                                             |

#### Table 6–22: DSC\_SYNC<sup>[^7]</sup> Descriptor Format

### 6.6.10 AdminGrp DSC\_ADM\_INTR Operation

The AdminGrp's Interrupt operation is used to generate interrupts from within an administrative context. The interrupt is generated from the local function hosting the administrative context.

| 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 | 15 14 13 12 | 11 10 9 8 | 7 6 5   | 4                | 3   | 2 1   | 0  |        |
|-------------------------------------------------|-------------|-----------|---------|------------------|-----|-------|----|--------|
| rsv type = 0x002                                | subtyp      | e = 0x05  | rsv     | <mark>csr</mark> | ch  | fe se | vl | + 0x00 |
| rs                                              | SV.         |           |         |                  |     |       |    | + 0x04 |
|                                                 |             |           |         |                  |     |       |    | + 0x08 |
| rsv                                             | rsv         |           | intr_nu | m                |     |       |    | + 0x0C |
|                                                 |             |           |         |                  |     |       |    | + 0x10 |
|                                                 |             |           |         |                  |     |       |    | + 0x14 |
|                                                 |             |           |         |                  |     |       |    | + 0x18 |
|                                                 |             |           |         |                  |     |       |    | + 0x1C |
|                                                 |             |           |         |                  |     |       |    | + 0x20 |
|                                                 |             |           |         |                  |     |       |    | + 0x24 |
|                                                 |             |           |         |                  |     |       |    | + 0x28 |
|                                                 |             |           |         |                  |     |       |    | + 0x2C |
|                                                 |             |           |         |                  |     |       |    | + 0x30 |
|                                                 |             |           |         |                  |     |       |    | + 0x34 |
| csb                                             | _ptr        |           |         |                  | rsv | ,     | np | + 0x38 |
|                                                 |             |           |         |                  |     |       |    | + 0x3C |

#### Figure 6-19: DSC\_ADM\_INTR Descriptor Format

#### Table 6–23: DSC\_ADM\_INTR<sup>[^7]</sup> Descriptor Format

| Field          | Bits    | Subfield     | Description                                                     |
|----------------|---------|--------------|-----------------------------------------------------------------|
| u32 opcode;    | 000     | vl = 1       | Valid. See section "6.1.1".                                     |
|                | 001     | se = *       | Sequential Consistency. See section "6.1.1". Shall be set to 0. |
|                | 002     | fe = *       | Fence. See section "6.1.1".                                     |
|                | 003     | ch = 0       | Chain. See section "6.1.1". Shall be set to 0.                  |
|                | 004     | csr          | Completion Status Mode Requirement for the descriptor.          |
|                | 007:005 | rsvd         | Shall be set to 0                                               |
|                | 015:008 | subtype=0x05 | See section "6.1.1".                                            |
|                | 026:016 | type=0x002   | See section "6.1.1".                                            |
|                | 031:027 | rsvd         | Shall be set to 0                                               |
| u8 rsvd_0[8];  | 095:032 |              | Shall be set to 0                                               |
| u16 intr_num;  | 107:096 | intr_num     | Specifies the MSI-X entry used to generate the interrupt.       |
|                | 111:108 | rsvd         | Shall be set to 0                                               |
| u8 rsvd_1[42]; | 447:112 | "            | Shall be set to 0                                               |
| u64 csb_ptr;   | 448     | np           | no_pointer. See section "6.1.1".                                |
|                | 452:449 | rsvd         | Shall be set to 0                                               |
|                | 511:453 | csb_ptr      | Completion Status Block pointer. See section "6.1.1".           |

### 7.1 Function Level Resources

### 7.1.1 Context Level 2 Table Base (MMIO\_CXT\_L2) Modification

The simplest method is shown below; others are possible.

- 1. Stop the function using the procedure described in "4.1.9, Stopping of the SDXI Function by Software".
- 2. Write MMIO\_CXT\_L2 with the new location of the Context Level 2 Table.
- 3. Activate the function using the procedure described in "4.1.8, Activation of the SDXI Function by Software".

## 7.2 Context Level Resources

### 7.2.1 Context Level 2 Table Entry (CXT\_L2\_ENT) Modification

Software may modify a Context Level 2 Table Entry (CXT\_L2\_ENT) for a number of reasons: make the entry invalid, add a Context Level 1 Table, or delete the Context Level 1 Table. The method for any of these cases is essentially the same.

- 1. Assumptions
  - a. Let "cl2\_idx" be the index offset of the desired CXT\_L2\_ENT from the beginning of the context level 2 table; for example, the fourth entry of the context level 2 table has an index of 3.
  - b. The desired CXT\_L2\_ENT points to a context level 1 table of 128 CXT\_L1\_ENT entries and indirectly to their associated contexts. Therefore, regardless of the individual context state, it is recommended to operate on all of them.
- 2. If there are running contexts associated with the CXT\_L2\_ENT, software shall stop them before making any other changes to the CXT\_L2\_ENT.
  - a. If context 0 (the administrative context) is included in the range of contexts associated with the desired CXT\_L2\_ENT, then for the below steps "start" = 1; otherwise "start" = (cl2\_idx << 7).
  - b. Issue DSC\_CXT\_STOP with cxt\_start = start and cxt\_end = (cl2\_idx << 7) + 127.
  - c. Issue DSC\_SYNC.STOP with cxt\_start = start and cxt\_end = (cl2\_idx << 7) + 127.
  - d. Wait for the DSC\_SYNC.STOP to complete.
  - e. If context 0 is included, issue DSC\_CXT\_STOP with cxt\_start = 0 and cxt\_end = 0. Wait until CXT\_STS.state of Context 0 is STOP\_CXT\_SW.
- 3. Modify the CXT\_L2\_ENT as appropriate.
  - a. If adding a new context level 1 table, initialize the associated CXT\_L1\_ENTs, and contexts as appropriate.
- 4. If context 0 is included, it must be restarted first through a jump-start method described in "4.3.4, *Starting A Context and Context Signaling*".
- 5. Propagate and synchronize the update.
  - a. Issue DSC\_CXT\_UPD.L2 with cxt\_start = (cl2\_idx << 7) and cxt\_end = (cl2\_idx << 7) + 127;
  - Issue DSC\_SYNC.CXT\_UPD with cxt\_start = (cl2\_idx << 7) and cxt\_end = (cl2\_idx << 7) + 127.</li>
  - c. Wait for the DSC\_SYNC.CXT\_UPD to complete.
- 6. Restart all relevant contexts if appropriate. All 128 contexts can be started by issuing a DSC\_START\_NM with cxt\_start = (cl2\_idx << 7) and cxt\_end = (cl2\_idx << 7) + 127.

SDXI is based on the PCI-Express device and software architecture including the use of configuration and MMIO register spaces and standard PCI-Express capabilities including, but not limited to, PCI Power Management, MSI/MSI-X interrupts and SR-IOV.

SDXI may be implemented using a range of function types such as PCIe EP or RCiEP. It may be implemented as either a Physical Function or Virtual Function.

Each SDXI function shall support standard PCI-Express defined reset mechanisms such as fundamental reset and hot-reset.

If SR-IOV is supported, PFs and VFs shall support Function Level Reset (FLR).

# 8.1 SDXI Function Configuration Space Registers

| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                |  |  |  |  |  |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|--|--|
| Vendor ID       +                                                                                                                                                                                                                                                                                                                     |  |  |  |  |  |
| $ \begin{array}{c ccccccccccccccccccccccccccccccccccc$                                                                                                                                                                                                                                                                                |  |  |  |  |  |
| DPE       SSE       RMA       RTA       STA       DEVSEL<br>0x0       MDP       FBC<br>0x0       rsvd       66<br>0x0       CAP<br>0x1       IS<br>0x0       rsvd       IR       rsvdP       D<br>0x0       FBE<br>0x0       SE       SWC<br>0x0       PE       PS       WI       SC<br>0x0       BM       MS       IO<br>0x0       + |  |  |  |  |  |
| ←Class Code →         Base Class - 0x12       Sub-Class - 0x1       Prog I/F - 0x0       Revision ID       +         BIST       MF       Header Layout - 0x0       Latency Timer - 0x0       Cache Line Size       +                                                                                                                  |  |  |  |  |  |
| Base Class - 0x12     Sub-Class - 0x1     Prog I/F - 0x0     Revision ID     +       BIST     MF     Header Layout - 0x0     Latency Timer - 0x0     Cache Line Size     +                                                                                                                                                            |  |  |  |  |  |
| BIST MF Header Layout - 0x0 Latency Timer - 0x0 Cache Line Size +                                                                                                                                                                                                                                                                     |  |  |  |  |  |
| +                                                                                                                                                                                                                                                                                                                                     |  |  |  |  |  |
| Day 0/1 Dystatebable Managy for MMIO Degisters                                                                                                                                                                                                                                                                                        |  |  |  |  |  |
| Bar 0/1 Prefetchable Memory for MMHO Registers                                                                                                                                                                                                                                                                                        |  |  |  |  |  |
|                                                                                                                                                                                                                                                                                                                                       |  |  |  |  |  |
| Bar 2/3 Preletchable Memory for Doorbells +                                                                                                                                                                                                                                                                                           |  |  |  |  |  |
| Bar 4 - Reserved 0x0                                                                                                                                                                                                                                                                                                                  |  |  |  |  |  |
| Bar 5 - Reserved 0x0 +                                                                                                                                                                                                                                                                                                                |  |  |  |  |  |
| Cardbus CIS Pointer - 0x0 +                                                                                                                                                                                                                                                                                                           |  |  |  |  |  |
| Subsystem ID Subsystem Vendor ID +                                                                                                                                                                                                                                                                                                    |  |  |  |  |  |
| Expansion ROM Base Address - 0x0                                                                                                                                                                                                                                                                                                      |  |  |  |  |  |
| Capabilities Pointer +                                                                                                                                                                                                                                                                                                                |  |  |  |  |  |
| Reserved                                                                                                                                                                                                                                                                                                                              |  |  |  |  |  |
| 1                                                                                                                                                                                                                                                                                                                                     |  |  |  |  |  |

Figure 8-1: PCI Config Space

### 8.1.1 Class Code

SDXI functions are expected to be identified through the SDXI class code.

- SNIA Smart Data Accelerator Interface (SDXI) controller:
  - Base Class = 0x12
  - Sub Class = 0x01
  - Programming Interface = 0x0

#### 8.1.2 BAR Configuration

| PCI Bar<br>Index | Width  | Size                                 | Туре                 | Description    |
|------------------|--------|--------------------------------------|----------------------|----------------|
| Bar 0/1          | 64 bit | 512KiB to 4MiB for PF. 512KiB for VF | Prefetchable, Memory | MMIO Registers |
| Bar 2/3          | 64 bit | Variable                             | Prefetchable, Memory | Doorbell BAR   |
| Bar 4            |        |                                      |                      | Reserved       |
| Bar 5            |        |                                      |                      | Reserved       |

If MSI-X is supported, the MSI-X table and PBA Offset table may be placed within any of the BARs including the reserved ones depending on hardware implementation. A block of MMIO space within BAR0/1 is reserved for use by MSI-X tables. Implementations are not required to use this reserved MMIO region.

The MMIO register BAR is marked as prefetchable to allow software to allocate the region within 64-bit MMIO space in order to avoid exhaustion of 32-bit MMIO space when implementing SR-IOV with many virtual functions. Please refer to the implementation note in the PCI-Express Base Specification which provides additional guidance on the use of the prefetchable attribute.

### 8.1.3 Required Capabilities and Extended Capabilities

SDXI functions are required to support the following capabilities:

- PCI Power Management
- MSI and/or MSI-X

Additional PCIe capabilities such as Advanced Error Reporting, Transaction Processing Hints, and Single Root I/O Virtualization are optional. See "4.1.7, Function Reset and Outstanding DMA Requests" for reset requirements.

### 8.2 Mapping sfunc Values to PCIe Requester ID Values

PCIe SDXI Functions map sfunc values onto PCIe Requester ID values. The mapping is implementation specific. Software may determine the mapping by reading MMIO\_CAP0.sfunc and associating the value with the Requester ID of the PCIe Function owning the register.

# 8.3 Mapping SDXI DSH to PCIe TLP Processing Hints (PCIe TPH)

A PCIe SDXI Function may support PCIe TLP Processing Hints (PCIe TPH) and map DSH information onto the corresponding PCIe TPH fields when accessing data buffers over the PCIe link. See "3.6, Data Steering Hints (DSH)" for more details on DSH.

A PCIe SDXI Function indicates support for PCIe TPH functionality through the inclusion of the PCIe TPH Requester Extended Capability. PCIe TPH capabilities such as the supported steering tag modes are indicated through the TPH Requester Capability Register.

System software is responsible for selecting the desired steering tag mode via the ST Mode Select register and enabling PCIe TPH by setting the PCIe TPH Requester Enable register to 1.

When a data buffer access has DSH information included, and PCIe TPH is enabled, the corresponding request on the PCIe link shall be generated with the PCIe TH (Transaction Hint) field set to 1. The PCIe TPH PH (Processing Hint) field is directly copied from the DSH "ph" field obtained from the data buffer's AKey table entry (AKEY\_ENT.ph) or RKey table entry (RKEY\_ENT.ph) as appropriate. The PCIe TPH ST (Steering Tag) information is controlled, in part, by the PCIe-defined ST Mode Select register:

- If No ST Mode is enabled, ST is set to 0.
- If Interrupt Vector Mode is enabled, the ST value is obtained by using the intr\_num (PCIe interrupt vector number) to index into the appropriate PCIe-defined structure holding the ST values, such as the MSI-X table. The intr\_num is obtained from the data buffer's AKey table entry or, if present, RKey table entry (AKEY\_ENT/RKEY\_ENT.intr\_num). The table entry must have the "iv" field set to 1 for intr\_num to be valid. If intr\_num is not valid, then ST is set to 0. The "stag" field in the AKey or RKey table entry is not used.
- If Device Specific Mode is enabled, ST is obtained from the data buffer's AKey table entry (AKEY\_ENT.stag) or RKey table entry (RKEY\_ENT.stag) as appropriate.

The use and control of PCIe TPH information with non-data-buffer accesses is implementation-specific Refer to the PCI-Express Base Specification for further details on PCIe TLP Processing Hints.

### 8.4 **PCIe Atomic Capabilities Discovery and Enablement**

Privileged software can determine whether a PCIe SDXI Function supports initiating atomic operations through the mechanisms described in 4.4 Atomic Operation Support. To enable a PCIe SDXI Function to initiate atomic operations, privileged software shall set the AtomicOp Requester Enable bit in the Device Control 2 Register. This register is located in the PCIe SDXI Function's configuration space PCI Express Capability structure.

Privileged software can determine whether a PCIe Function supports completing atomic operations by checking the 32-bit AtomicOp and 64-bit AtomicOp supported fields in the Device Capabilities 2 Register of that Function's configuration space PCI Express Capability structure. The Function may be an upstream Root Port for the PCIe SDXI Function, or another PCIe Endpoint Function acting as a peer-to-peer target.

Privileged software can check whether Bridge Functions, such as Switch Port and Root Port, located between a PCIe SDXI Function and a target PCIe Function support routing atomic operations by checking the AtomicOp Routing Supported field in the Device Capabilities 2 Register of those Bridge Functions" configuration space PCI Express Capability structures.

Each function's MMIO registers are grouped by category within separate 64Kbyte regions, as shown in *"Table 9–1: MMIO Control Registers"*, to facilitate CPU MMU-based protection and emulation across a range of possible processor architectures. PFs may implement an MMIO region of between 512KB and 4MB depending on the amount of MMIO space required to hold mailboxes for the number of supported VFs. VFs each consume 512KB of MMIO space.

The requirements to access an SDXI function's MMIO register are described here. Software should not depend on the results of any unsupported access (as described in this section) to these MMIO registers. A misaligned access that straddles MMIO registers is not supported and results in undefined operation of the SDXI function. A write to an MMIO register must be ordered with respect to other accesses to the same register; software must use processor-specific means to ensure that the required memory ordering is enforced.

For PCIe implementations, the SDXI function's MMIO registers are located in PCIe prefetchable memory space. While it is expected that most platforms will not merge separate writes to an SDXI function, software may be required on some platforms to work around this when writing SDXI function MMIO registers. Please consult platform-specific references and the *PCI Express Base Specification Revision 5.0 -- Implementation Note: "Additional Guidance on the Prefetchable Bit in Memory Space BARs"*.

The SDXI function's MMIO register space is segmented logically into two regions for the purposes of discussing the access model: the Doorbell MMIO registers, and the remaining MMIO registers.

For the Doorbell MMIO registers, only a naturally-aligned 64-bit write is supported. Any other write access is ignored by the SDXI function. A read access of a Doorbell MMIO register is not supported; the SDXI function must return a result but the result is architecturally undefined.

For the remaining MMIO registers, the SDXI function supports all naturally-aligned accesses up to 64-bits as atomic with one exception: if an implementation can not support 64-bit atomic accesses it will report MMIO\_CAP1.mmio64 as "0". Misaligned reads and writes within an MMIO register are not supported; the SDXI function is allowed to return or write stale data as determined by the access type. Attempting to write reserved fields in an MMIO register with an illegal value results in an undefined value for the register.

A "torn" read or write of an MMIO register can result when at least two agents - any of the SDXI function or one or more software threads -- are concurrently accessing the fields of the register in a non-atomic manner. The torn access can result in stale data being intermixed with more recent data in an unpredictable way. Software is solely responsible for avoiding torn reads and writes on accesses that straddle fields within an MMIO register. Software shall solely ensure synchronized access among multiple software threads; the method for which is beyond the scope of this specification.

In an environment with synchronized software access, the following list describes methods a single software thread can use to avoid undefined operation and torn accesses with respect to the SDXI function itself; SDXI implementations shall ensure that these methods are supported.

- If the register is not dynamically updated by the SDXI function, then any naturally-aligned read will avoid returning torn data.
- When the SDXI function's MMIO\_STS0.fn\_gsv is "GSV\_STOP", any naturally-aligned read or write will avoid torn data.
- This is the only supported method for MMIO\_CXT\_L2 (MMIO Offset: 0x1\_0000)
- If possible, use a supported naturally-aligned atomic read or write that maps solely to complete fields in the MMIO register.
- If the MMIO register has an associated valid/enable bit (in some cases this is in another MMIO register), then disable or invalidate the register, read the enable/valid bit back to confirm it has changed value, make modifications, and then enable or validate the register.

The Error log read and write indexes are 64-bit MMIO registers and SDXI functions that report MMIO\_CAP1.mmio64 as "0" may exhibit torn accesses. Since the Error Log can contain no more than 2^26 entries, the contents of the ring can be calculated and read by software using only the lower 32-bits of the index registers, and thereby avoid the effects of torn reads of MMIO\_ERR\_WRT. An SDXI function that reports MMIO\_CAP1.mmio64 as "0" must account for the possibility of receiving torn MMIO writes to MMIO\_ERR\_RD.

| 64KB Region               | PF Usage                                                 | VF Usage                                                  |
|---------------------------|----------------------------------------------------------|-----------------------------------------------------------|
| 0x00_0000                 | General Control and Status Registers                     |                                                           |
| 0x01_0000                 | Context Table Registers                                  |                                                           |
| 0x02_0000                 | Error Logging Control and Status Registers               |                                                           |
| 0x03_0000                 | Mailbox Control Registers                                |                                                           |
| 0x04_0000                 | Reserved for MSI-X                                       |                                                           |
| 0x05_0000                 | Reserved                                                 |                                                           |
| 0x06_0000 to<br>0x25_FFFF | Alternating 64K regions of Send and Receive<br>Mailboxes | Send Mailbox at 0x6_0000.<br>Receive Mailbox at 0x7_0000. |
| 0x26_0000 to<br>0x3F_FFFF | Reserved                                                 | N/A                                                       |

# 9.1 General Control and Status Registers

| Field          | Bits    | Subfield       | Type &<br>Reset | Description                                                                                                                                                                                                                                                                                     |                                                |
|----------------|---------|----------------|-----------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------|
| u32 field0;    | 001:000 | fn_gsr         | RW, 0x0         | Controls the SDXI function state. See "4.1 SDX<br>Function State" for more detail.                                                                                                                                                                                                              | XI                                             |
|                |         |                |                 | fn_gsr                                                                                                                                                                                                                                                                                          |                                                |
|                |         |                |                 | 0b00 GSRV_RESET                                                                                                                                                                                                                                                                                 |                                                |
|                |         |                |                 | 0b01 GSRV_STOP_SF                                                                                                                                                                                                                                                                               |                                                |
|                |         |                |                 | 0b10 GSRV_STOP_HD                                                                                                                                                                                                                                                                               |                                                |
|                |         |                |                 | 0b11 GSRV_ACTIVE                                                                                                                                                                                                                                                                                |                                                |
|                | 002     | fn_pasid_vl    | RW, 0x0         | Indicates whether the fn_pasid field is used wh<br>accessing context level 2 or 1 table entries, Cc<br>Control Entries, AKey table entries, RKey table<br>and error log entries.<br>If fn_pasid_vl is "0", requests to the referenced<br>structures are accessed using DMA requests v<br>PASID. | ien<br>ontext<br>e entries,<br>data<br>vithout |
|                | 003     | rsvd           | R, 0x0          | Shall be set to zero.                                                                                                                                                                                                                                                                           |                                                |
|                | 004     | fn_err_intr_en | RW, 0x0         | <ul> <li>1 = An interrupt is signaled when the function transitions to the GSV_ERROR state as reported through MMIO_STS0.fn_gsv. MSI or MSI-X is enabled, message 0 is signaled.</li> <li>0 = No interrupt is generated</li> </ul>                                                              | on<br>S<br>When                                |
|                | 007:005 | rsvd           | R, 0x0          | Shall be set to zero.                                                                                                                                                                                                                                                                           |                                                |
|                | 027:008 | fn_pasid       | RW, 0x0         | Function PASID value.                                                                                                                                                                                                                                                                           |                                                |
|                | 031:028 | rsvd           | R, 0x0          | Shall be set to zero.                                                                                                                                                                                                                                                                           |                                                |
| u32 fn_grp_id; | 063:032 |                | RW, 0x0         | This field is provided for software to record the<br>function group ID assignment. The field has<br>on the operation of the SDXI function. In a V<br>field is read-only and reflects the value of the<br>associated PF. See "3.3.1, SDXI Function G<br>more details.                            | ne<br>no effect<br>F, the<br>e<br>roup" for    |

Table 9–2: MMIO\_CTL0<sup>[^1]</sup> (MMIO Offset :0x0\_0000)

| Field         | Bits    | Subfield | Type &<br>Reset | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                |
|---------------|---------|----------|-----------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u8 field0;    | 000     | busy     | RW, 0x0         | When "0", there is no outstanding write propagation of this function's "probe" field to the probe fields of other functions in the function group.<br>When "1", there is an outstanding write propagation of this function's "probe" field to the probe fields of other functions in the function group.<br>This field does not propagate. In a VF, the field is read-only and reflects the value of the associated PF. See "3.3.1, SDXI Function Group" for more details. |
|               | 001     | probe    | RW, 0x0         | The current value of the probe field. When written, this field propagates to the same field in other PFs within the function group. In a VF, the field is read-only and reflects the value of the associated PF. See "3.3.1, SDXI Function Group" for more details.                                                                                                                                                                                                        |
|               | 007:002 | rsvd     | R, 0x0          | This field always reads back as 0.                                                                                                                                                                                                                                                                                                                                                                                                                                         |
| u8 rsvd_0[7]; | 063:008 |          | R, 0x0          | This field always reads back as 0.                                                                                                                                                                                                                                                                                                                                                                                                                                         |

Table 9–3: MMIO\_GRP\_ENUM<sup>[^1]</sup> (MMIO Offset: 0x0\_0008)

| Table 9–4: | MMIO                                   | CTL2 <sup>[^1]</sup> | (MMIO   | Offset: | 0x0 | 0010) |
|------------|----------------------------------------|----------------------|---------|---------|-----|-------|
|            | ······································ |                      | (······ |         |     | /     |

| Field            | Bits    | Subfield    | Type &<br>Reset | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                            |
|------------------|---------|-------------|-----------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u16 field0;      | 003:000 | max_buffer  | RW              | This field controls the maximum data buffer size enabled<br>for use by this function. The field is encoded the same as<br>MMIO_CAP1.max_buffer and must not exceed it in<br>value.<br>Reset: MMIO_CAP1.max_buffer                                                                                                                                                                                                                                                      |
|                  | 011:004 | rsvd        | R, 0x0          | Shall be set to zero.                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
|                  | 015:012 | max_akey_sz | RW              | This field controls the maximum size of any AKey table<br>useable by any context within this function. The field is<br>encoded the same as MMIO_CAP1.max_akey_sz and<br>must not exceed it in value.<br>Reset: MMIO_CAP1.max_akey_sz                                                                                                                                                                                                                                   |
| u16 max_cxt;     | 031:016 |             | RW              | This field controls the maximum number of contexts<br>enabled for use by this function. The field is encoded the<br>same MMIO_CAP1.max_cxt and must not exceed it in<br>value.<br>Reset: MMIO_CAP1.max_cxt                                                                                                                                                                                                                                                             |
| u32 opb_000_avl; | 063:032 |             | RW,<br>0x0      | Each bit in this field, when set to "1", indicates that a certain descriptor operation group is available for all contexts within the function; when "0" it is not. The bit encoding matches the MMIO_CAP1.opb_000_cap register. For a context to use a supported and available operation group, the group must also be enabled in CXT_L1.opb_000_enb. See "5.1, Descriptor Operations" and "Chapter 6, SDXI Descriptor and Operation Specification" for more details. |

| Field         | Bits    | Subfield | Type &<br>Reset | Description                                                                                                                                                                                                                                                                                                                                       |
|---------------|---------|----------|-----------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u8 field0;    | 002:000 | fn_gsv   | R, 0x0          | This field describes the overall state of the SDXI function. This<br>register does not indicate the state of any specific context<br>000b – GSV_STOP<br>001b – GSV_INIT<br>010b – GSV_ACTIVE<br>011b – GSV_STOPG_SF<br>100b – GSV_STOPG_HD<br>101b – GSV_ERROR<br>All other encodings reserved.<br>See "4.1 SDXI Function State" for more detail. |
|               | 007:003 | rsvd     | R, 0x0          | Shall be set to zero.                                                                                                                                                                                                                                                                                                                             |
| u8 rsvd_0[7]; | 063:008 |          | R, 0x0          | Shall be set to zero.                                                                                                                                                                                                                                                                                                                             |

Table 9–5: MMIO\_STS0\* (MMIO Offset: 0x0\_0100)

| Field              | Bits    | Subfield  | Type &<br>Reset | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |
|--------------------|---------|-----------|-----------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u16 sfunc;         | 015:000 |           | R               | An opaque SDXI Function identifier that maps uniquely to<br>an SDXI function. For example, in a PCIe implementation of<br>SDXI, "sfunc" could map to a PCIe Requester ID. It is<br>referenced by AKey table entries. sfunc is unique for each<br>SDXI function within an SDXI Function Group. This field<br>may be set to 0 for implementations with<br>MMIO_CAP1.rkey_cap set to "0".                                                                                                                  |
| u8 field0;         | 016     | vf        | R               | <ul> <li>1 = Indicates a virtual function. Software should use<br/>the virtual function programming model including<br/>MMIO register layout and mailbox programming.</li> <li>0 = Indicates a physical function. Software should use<br/>the physical function programming model including<br/>MMIO register layout and mailbox programming.</li> </ul>                                                                                                                                                |
|                    | 018:017 | cs_cap    | R, IMPL         | Completion-Status Capabilities. See "4.4.1, Completion-<br>Status Capabilities" for details.                                                                                                                                                                                                                                                                                                                                                                                                            |
|                    | 019     | rsvd      | R, 0x0          | rsvd                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
|                    | 022:020 | db_stride | R, IMPL         | Indicates the address stride between doorbell sections. The stride is encoded as 2**(db_stride+12) bytes.                                                                                                                                                                                                                                                                                                                                                                                               |
|                    | 23      | rsvd      | R, 0x0          | rsvd                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
| u8 max_ds_ring_sz; | 028:024 | max_sz    | R, IMPL         | Indicates the maximum size of any descriptor ring for any<br>context within this function. The maximum descriptor ring<br>size that is supported is 2**(max_ds_ring_sz + 16) bytes.<br>Valid max_ds_ring_sz values range from 0 (64 KB or 1K<br>descriptors) to 22 (256 GB or 4G descriptors); all other<br>encodings are reserved.<br>Software shall ensure for all valid contexts that:<br>- Let max_ds = 2**(MMIO_CAP0.max_ds_ring_sz + 10)<br>- Then CXT_CTL.ds_ring_sz <= min( (2**32)-1, max_ds ) |
|                    | 031:029 | rsvd      | R, IMPL         | rsvd                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
| u8 max_rkey_sz;    | 035:032 | max_sz    | R, IMPL         | Indicates the maximum size of the RKey table for this function. The maximum RKey table size that is supported is 2**(12 + max_rkey_sz) bytes. Values of max_rkey_sz greater than 0x8 are reserved.                                                                                                                                                                                                                                                                                                      |
|                    | 039:036 | rsvd      | R, 0x0          | rsvd                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |
| u8 rsvd_0[3];      | 063:040 |           | R, 0x0          | rsvd                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    |

#### Table 9–6: MMIO\_CAP0<sup>[^1]</sup> (MMIO Offset: 0x0\_0200)

| Field            | Bits    | Subfield      | Type &<br>Reset | Description                                                                                                                                                                                                                                                                                                                                                                           |
|------------------|---------|---------------|-----------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u16 field0;      | 003:000 | max_buffer    | R, IMPL         | Indicates the maximum data buffer size supported by this function for operations. The maximum data buffer size that is supported is 2**(max_buffer+21) bytes. Valid max_buffer values range from 0 (2 MiB) to 11 (4 GiB); all other encodings are reserved.                                                                                                                           |
|                  | 004     | rkey_cap      | R, IMPL         | Indicates whether RKey functionality is supported.<br>RKey support is required to support data transfers<br>between functions in an SDXI group.                                                                                                                                                                                                                                       |
|                  | 005     | rm            | R, IMPL         | Memory Registration Required. Reserved for use with the SDXI Connection Operation Group.                                                                                                                                                                                                                                                                                              |
|                  | 006     | mmio64        | R, IMPL         | <ul> <li>When 1, indicates the SDXI Function implements 64-<br/>bit atomic access to MMIO registers.</li> <li>When 0, 32-bit is the largest atomic access to MMIO<br/>registers. See the beginning of "9 MMIO Control<br/>Registers" for more details. Behaviors associated<br/>with this bit do not apply to the doorbell registers.</li> </ul>                                      |
|                  | 007     | rsvd          | R               | rsvd                                                                                                                                                                                                                                                                                                                                                                                  |
|                  | 011:008 | max_errlog_sz | R, IMPL         | Indicates the maximum size of the error log for this function. The maximum error log size that is supported is 2**(max_errlog_sz + 23) bytes. Valid max_errlog_sz values range from 0 (8 MB) to 9 (4 GB); all other encodings are reserved.                                                                                                                                           |
|                  | 015:012 | max_akey_sz   | R, IMPL         | Indicates the maximum size of any AKey table<br>referenced by any context within this function. The<br>maximum size that is supported is<br>2**(max_akey_sz+12) bytes.<br>Values of max_akey_sz greater than 0x8 are<br>reserved.                                                                                                                                                     |
| u16 max_cxt;     | 031:016 |               | R, IMPL         | The contexts supported by this function range from 0 to max_cxt. The maximum number of contexts supported by this function is max_cxt+1.                                                                                                                                                                                                                                              |
| u32 opb_000_cap; | 063:032 |               | R, IMPL         | Each bit in this field is set to 1 to indicate whether a certain descriptor operation-group is supported by the SDXI function. For a context to use a supported operation group, it must be made available in MMIO_CTL2.opb_000_avl and enabled in CXT_L1.opb_000_enb. See "5.1 Descriptor Operations" and "Chapter 6, SDXI Descriptor and Operation Specification" for more details. |

Table 9–7: MMIO\_CAP1<sup>[^1]</sup> (MMIO Offset: 0x0\_0208)

|               |         |          | _               | /                                                                                                                       |
|---------------|---------|----------|-----------------|-------------------------------------------------------------------------------------------------------------------------|
| Field         | Bits    | Subfield | Type &<br>Reset | Description                                                                                                             |
| u8 minor;     | 007:000 |          | r, impl         | Indicates Minor Version number of SDXI function implementation. 0x0 for the version of SDXI described in this document. |
| u8 rsvd_0;    | 015:008 |          | R, 0x0          | Reserved.                                                                                                               |
| u8 major;     | 023:016 |          | r, impl         | Indicates Major Version number of SDXI function implementation. 0x1 for the version of SDXI described in this document. |
| u8 rsvd_1;    | 031:024 |          | R, 0x0          | Reserved.                                                                                                               |
| u8 rsvd_2[4]; | 063:032 |          | R, 0x0          | Reserved.                                                                                                               |

Table 9–8: MMIO\_VERSION<sup>[^1]</sup> (MMIO Offset: 0x0\_0210)

# 9.2 Context and RKey Table Registers

| Table 9–9: | MMIO | CXT | L2 <sup>[^1]</sup> | (MMIO    | Offset: 0x1 | 0000^) |
|------------|------|-----|--------------------|----------|-------------|--------|
|            |      |     |                    | <b>`</b> |             | _ /    |

| Field         | Bits    | Subfield | Type &<br>Reset | Description                           |
|---------------|---------|----------|-----------------|---------------------------------------|
| u64 lv02_ptr; | 011:000 | rsvd     | R, 0x0          | rsvd                                  |
|               | 063:012 | lv02_ptr | RW, 0x0         | Pointer to the context level 2 table. |

| Table 9–10: | MMIO | RKEY <sup>[^1]</sup> | (MMIO | Offset: | 0x1 | 0100) |
|-------------|------|----------------------|-------|---------|-----|-------|
|             | _    |                      |       |         | -   | /     |

| Field    | Bits    | Subfield | Type &<br>Reset | Description                                                                                                                                                                                                                                 |
|----------|---------|----------|-----------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u64 ptr; | 000     | en       | RW, 0x0         | When set to 1, RKey functionality is enabled. Also, sz and ptr contain valid information.                                                                                                                                                   |
|          | 004:001 | sz       | RW, 0x0         | Controls the size of the RKey table. This field is encoded the same<br>as MMIO_CAP0.max_rkey_sz and must not exceed it in value.<br>SNIASDXISpecification-v1.0 requires that software shall not program<br>sz with values greater than 0x8. |
|          | 011:005 | rsvd     | R, 0x0          | Rsvd                                                                                                                                                                                                                                        |
|          | 063:012 | ptr      | RW, 0x0         | Pointer to the start of the RKey table.                                                                                                                                                                                                     |

# 9.3 Error Logging Control and Status Registers

| Field        | Bits    | Subfield | Type &<br>Reset | scription                                                                                                                              |                                                                              |
|--------------|---------|----------|-----------------|----------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------|
| u64 intr_en; | 000     | en       | RW, 0x0         | <ul> <li>An interrupt is signaled w<br/>MMIO_ERR_STS.sts bit<br/>MSI-X is enabled, messa</li> <li>No interrupt is generated</li> </ul> | when hardware transitions the from "0" to "1". When MSI or ge 0 is signaled. |
|              | 063:001 | rsvd     | R, 0x0          | Ł                                                                                                                                      |                                                                              |

Table 9–11: MMIO\_ERR\_CTL<sup>[^1]</sup> (MMIO Offset:0x2\_0000)

### Table 9–12: MMIO\_ERR\_STS<sup>[^1]</sup> (MMIO Offset:0x2\_0008)

| Field     | Bits    | Subfield | Type &<br>Reset | Descr      | ription                                                                                                                                                                                                                                                                                                                                                                                                                |
|-----------|---------|----------|-----------------|------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u64 info; | 000     | sts      | RW1C, 0x0       | 1 =        | An attempt to record an error in the log was<br>made. See err bit for the success or failure of<br>this operation. Once the SDXI function sets the<br>sts bit, no new error interrupts, if enabled, will<br>be generated until the bit is cleared (by<br>software writing "1" to the bit). Additional errors<br>may continue to be recorded in the log while the<br>sts bit is "1".<br>An error has not been recorded. |
|           | 001     | ovf      | RW1C, 0x0       | 1 =<br>0 = | The error log has overflowed, and some error<br>information was lost. MMIO_ERR_STS.err will<br>be set on an overflow event. The ovf bit is a<br>status bit and does not control operation of the<br>error log.<br>The error log has not overflowed.                                                                                                                                                                    |
|           | 002     | rsvd     | R, 0x0          | rsvd       |                                                                                                                                                                                                                                                                                                                                                                                                                        |
|           | 003     | err      | RW1C, 0x0       | 1 =        | An error occurred attempting to write the error<br>log and some error information was lost.<br>Further error logging and hardware modification<br>of the MMIO_ERR_STS register is disabled<br>until this bit is cleared (by software writing "1" to<br>the bit).<br>It has not.                                                                                                                                        |
|           | 063:004 | rsvd     | R, 0x0          | rsvd       |                                                                                                                                                                                                                                                                                                                                                                                                                        |

| Field    | Bits    | Subfield | Type &<br>Reset | Description                                                                                                                                                                                             |
|----------|---------|----------|-----------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u64 ptr; | 000     | en       | RW, 0x0         | When true, the error log mechanism is enabled and shall use the values of the MMIO_ERR_CFG.ptr and MMIO_ERR_CFG.sz fields.                                                                              |
|          | 005:001 | sz       | RW, 0x0         | This field controls the error log size. The size of the error log is 2**(sz + 12) bytes. The error log size must not exceed the max capability of the function as specified by MMIO_CAP1.max_errlog_sz. |
|          | 011:006 | rsvd     | R, 0x0          | rsvd                                                                                                                                                                                                    |
|          | 063:012 | ptr      | RW, 0x0         | Pointer to the start of the error log.                                                                                                                                                                  |

Table 9–13: MMIO\_ERR\_CFG<sup>[^1]</sup> (MMIO Offset:0x2\_0010)

#### Table 9–14: MMIO\_ERR\_WRT<sup>[^1]</sup> (MMIO Offset: 0x2\_0020)

| Field     | Bits    | Subfield | Type &<br>Reset | Description                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
|-----------|---------|----------|-----------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u64 index | 063:000 |          | RW,0x0          | Description: Hardware error log write pointer. The function<br>increments the write pointer when an error log entry is successfully<br>written. The write pointer is not incremented when writing of an error<br>log entry results in the MMIO_ERR_STS.err bit being set. The starting<br>byte offset of the last error log entry written by the function is given by:<br>- Let log_sz = 2**(MMIO_ERR_CFG.sz +12);<br>- Let log_bs = MMIO_ERR_CFG & ~0xFFF;<br>- Let index = MMIO_ERR_WRT - 1<br>- Then byte_offset = log_bs + ( (index * 64) % log_sz ); |
|           |         |          |                 | The error log is empty when MMIO_ERR_WRT == MMIO_ERR_RD.<br>The write pointer value is expected to never wrap-around to 0. The<br>memory buffer used by the error log may wrap around.<br>Software may write MMIO_ERR_WRT only when error logging is<br>disabled either through MMIO_ERR_CFG.en or MMIO_ERR_STS.err.<br>Writes to this register at any other time result in undefined behavior.<br>Software must always read this field with a 64-bit access regardless<br>of the value the function returns in MMIO_CAP1.mmio64.                         |

#### Table 9–15: MMIO\_ERR\_RD<sup>[^1]</sup> (MMIO Offset: 0x2\_0028)

| Field     | Bits    | Subfield | Type &<br>Reset | Description                                                                                                                                                                       |
|-----------|---------|----------|-----------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u64 index | 063:000 |          | RW,0x0          | Description: Error log read pointer. This points to the starting byte offset of the first error log entry that has not been consumed (read) by software in this way:              |
|           |         |          |                 | <ul> <li>Let log_sz = 2**(MMIO_ERR_CFG.sz +12);</li> <li>Let log_bs = MMIO_ERR_CFG &amp; ~0xFFF;</li> <li>Then byte_offset = log_bs + ( (MMIO_ERR_RD * 64) % log_sz );</li> </ul> |
|           |         |          |                 | Software must always write this field with a 64-bit access regardless of the value the function returns in MMIO_CAP1.mmio64.                                                      |
# 9.4 MBOX Mailbox Registers

The PF/VF mailbox interface allows for low-bandwidth two-way communication under SR-IOV virtualization between privileged software on a PF and the software managing a VF. This may be used directly during runtime to pass control messages between privileged software and the VMs. It may also be used to bootstrap higher bandwidth forms of communication such as shared memory buffers, or even memory ring-buffers and interrupts that are written or generated by SDXI contexts

There is no mailbox communication directly between VFs. There is no mailbox communication directly between functions that belong to different SDXI SR-IOV devices even if they belong to the same SDXI Group. All mailbox registers are reserved in implementations that do not support SR-IOV. An implementation is only required to implement enough mailbox registers to support the number of VFs it implements.

Each mailbox is 16 bytes in size. There are separate transmit and receive mailboxes between each VF and the PF.

It is recommended that under virtualization, the Hypervisor take ownership of all SDXI PFs in order to facilitate communication between VFs under the same PF, and between SDXI VFs belonging to different PFs within the same SDXI function group.

VF\_1 is the first virtual function under the PF. The PF will use a MMIO\_MBS\_CTL.tgt\_vf value of 0 to set the MMIO\_MBR\_ST.rcv bit on VF\_1. When VF\_1 clears its rcv bit, the PF's MMIO\_MBS\_ST.ack bit 0 will become set. If the VF sends a message to the PF, the PF's MMIO\_MBR\_ST.rcv bit 0 will become set. Subsequent VFs, if present, will follow the same pattern. VF\_N will use the tgt\_vf value of N-1, ack bit N-1, and rcv bit N-1.

| Field        | Bits    | Subfield | Type &<br>Reset | Description                                                                                                                                                                    |
|--------------|---------|----------|-----------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u64 intr_en; | 000     | snd      | RW, 0x0         | Send message acknowledge interrupt enable. When 1, an interrupt is generated when any bit in MMIO_MBS_ST.ack is set to 1. When MSI or MSI-X is enabled, message 0 is signaled. |
|              | 001     | rcv      | RW, 0x0         | Receive message interrupt enable. When 1, an interrupt is generated when any bit in MMIO_MBR_ST.rcv is set to 1. When MSI or MSI-X is enabled, message 0 is signaled.          |
|              | 063:002 | rsvd     | R, 0x0          | rsvd                                                                                                                                                                           |

#### Table 9–16: MMIO\_MB\_CTL<sup>[^1]</sup> (MMIO Offset:0x3\_0000)

#### Table 9–17: MMIO\_MBS\_CTL<sup>[^1]</sup> (MMIO Offset: 0x3\_0010)

| Field         | Bits    | Subfield | Type &<br>Reset | Description                                                                                                                                       |  |  |  |
|---------------|---------|----------|-----------------|---------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|
| u16 msg;      | 000     | msg      | W               | Writing this bit to 1 causes the target MMIO_MBR_ST.rcv bit to be<br>set in the target function's mailbox.<br>VF's always send to the PF mailbox. |  |  |  |
|               | 015:001 | rsvd     | 0x0             | rsvd                                                                                                                                              |  |  |  |
| u16 tgt_vf;   | 031:016 |          | W               | Indicates the target VF number to be used by the msg field of this register.<br>Shall be set to 0 for VF copies of this register.                 |  |  |  |
| u8 rsvd_0[4]; | 063:032 |          | 0x0             | rsvd                                                                                                                                              |  |  |  |

| Field    | Bits    | Subfield | Type & | Description                                                                                                                                                                                                                                                                                                                                                                      |
|----------|---------|----------|--------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|          |         |          | Reset  |                                                                                                                                                                                                                                                                                                                                                                                  |
| u64 ack; | 063:000 |          | RW1C   | Message acknowledgment status. Indicates that the last<br>message sent to the corresponding function was<br>acknowledged by the receiver. VFs implements only bit 0 at<br>offset 0x3_1000 representing a message received by the<br>PF. The PF implements as many bits as there are VFs. This<br>bit should be cleared prior to sending a new message to the<br>target function. |

#### Table 9–18: MMIO\_MBS\_ST<sup>[^1]</sup> (MMIO Offset: 0x3\_1000 to 0x3\_2FFF)

#### Table 9–19: MMIO\_MBR\_ST<sup>[^1]</sup> (MMIO Offset: 0x3\_3000 to 0x3\_4FFF)

| Field    | Bits    | Subfield | Type &<br>Reset | Description                                                                                                                                                                                                                                                                                                              |
|----------|---------|----------|-----------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| u64 rcv; | 063:000 |          | RW1C,<br>0x0    | Receive message status. Indicates that rcv_data from the mailbox corresponding to the status bit is valid. VFs implements only bit 0 at offset 0x3_3000 representing a message received from the PF. The PF implements as many bits as there are VFs. Clearing this status bit returns an acknowledgement to the sender. |

# 9.5 **PF Mailbox Data Registers**

#### Table 9–20: Send Mailbox Data Format

| Field       | Bits    | Subfield | Type &<br>Reset | Description                                          |
|-------------|---------|----------|-----------------|------------------------------------------------------|
| u8 msg[16]; | 127:000 |          | RW              | 16 bytes of mailbox message data to be sent to a VF. |

#### Table 9–21: Receive Mailbox Data Format

| Field       | Bits    | Subfield | Type &<br>Reset | Description                                          |
|-------------|---------|----------|-----------------|------------------------------------------------------|
| u8 msg[16]; | 127:000 |          | R               | 16 bytes of mailbox message data received from a VF. |

| MMIO Addresses         | MMIO Send Mailbox Range |
|------------------------|-------------------------|
| 0x06_0000 to 0x06_FFFF | MMIO_MBS[0000:0FFF]     |
| 0x08_0000 to 0x08_FFFF | MMIO_MBS[1000:1FFF]     |
| 0x0A_0000 to 0x0A_FFFF | MMIO_MBS[2000:2FFF]     |
| 0x0C_0000 to 0x0C_FFFF | MMIO_MBS[3000:3FFF]     |
| 0x0E_0000 to 0x0E_FFFF | MMIO_MBS[4000:4FFF]     |
| 0x10_0000 to 0x10_FFFF | MMIO_MBS[5000:5FFF]     |
| 0x12_0000 to 0x12_FFFF | MMIO_MBS[6000:6FFF]     |
| 0x14_0000 to 0x14_FFFF | MMIO_MBS[7000:7FFF]     |
| 0x16_0000 to 0x16_FFFF | MMIO_MBS[8000:8FFF]     |
| 0x18_0000 to 0x18_FFFF | MMIO_MBS[9000:9FFF]     |
| 0x1A_0000 to 0x1A_FFFF | MMIO_MBS[A000:AFFF]     |
| 0x1C_0000 to 0x1C_FFFF | MMIO_MBS[B000:BFFF]     |
| 0x1E_0000 to 0x1E_FFFF | MMIO_MBS[C000:CFFF]     |
| 0x20_0000 to 0x20_FFFF | MMIO_MBS[D000:DFFF]     |
| 0x22_0000 to 0x22_FFFF | MMIO_MBS[E000:EFFF]     |
| 0x24_0000 to 0x24_FFFF | MMIO_MBS[F000:FFFF]     |

Table 9–22: MMIO Send Mailbox Registers

Table 9–23: MMIO Receive Mailbox Registers

| MMIO Addresses         | MMIO Receive Mailbox Range |
|------------------------|----------------------------|
| 0x07_0000 to 0x07_FFFF | MMIO_MBR[0000:0FFF]        |
| 0x09_0000 to 0x09_FFFF | MMIO_MBR[1000:1FFF]        |
| 0x0B_0000 to 0x0B_FFFF | MMIO_MBR[2000:2FFF]        |
| 0x0D_0000 to 0x0D_FFFF | MMIO_MBR[3000:3FFF]        |
| 0x0F_0000 to 0x0F_FFFF | MMIO_MBR[4000:4FFF]        |
| 0x11_0000 to 0x11_FFFF | MMIO_MBR[5000:5FFF]        |
| 0x13_0000 to 0x13_FFFF | MMIO_MBR[6000:6FFF]        |
| 0x15_0000 to 0x15_FFFF | MMIO_MBR[7000:7FFF]        |
| 0x17_0000 to 0x17_FFFF | MMIO_MBR[8000:8FFF]        |
| 0x19_0000 to 0x19_FFFF | MMIO_MBR[9000:9FFF]        |
| 0x1B_0000 to 0x1B_FFFF | MMIO_MBR[A000:AFFF]        |
| 0x1D_0000 to 0x1D_FFFF | MMIO_MBR[B000:BFFF]        |
| 0x1F_0000 to 0x1F_FFFF | MMIO_MBR[C000:CFFF]        |
| 0x21_0000 to 0x21_FFFF | MMIO_MBR[D000:DFFF]        |
| 0x23_0000 to 0x23_FFFF | MMIO_MBR[E000:EFFF]        |
| 0x25_0000 to 0x25_FFFF | MMIO_MBR[F000:FFFF]        |

# 9.6 VF Mailbox Data Registers

| Field       | Bits    | Subfield | Type &<br>Reset | Description                                            |
|-------------|---------|----------|-----------------|--------------------------------------------------------|
| u8 msg[16]; | 127:000 |          | RW              | 16 bytes of mailbox message data to be sent to the PF. |

Table 9–24: MMIO\_MBS<sup>[^1]</sup> (MMIO Offset: 0x6\_0000)

### Table 9–25: MMIO\_MBR<sup>[^1]</sup> (MMIO Offset: 0x7\_0000)

| Field       | Bits    | Subfield | Type &<br>Reset | Description                                            |
|-------------|---------|----------|-----------------|--------------------------------------------------------|
| u8 msg[16]; | 127:000 |          | R               | 16 bytes of mailbox message data received from the PF. |

# 9.7 Doorbell Sections and Registers

The SDXI function provides a per-context, MMIO-based Doorbell Section that is used by software to indicate that new work is available for that particular context. Software uses an interconnect-specific mechanism to map the start of this MMIO region (Doorbell\_MMIO\_START) with the SDXI function; for example, a PCI BAR. The MMIO region for Doorbell Sections is divided into doorbell-stride sized sections (based on MMIO\_CAP0.db\_stride). Each section, numbered from zero, corresponds to the context of the same number and is naturally aligned to a boundary of 2\*\*(MMIO\_CAP0.db\_stride + 12) bytes. This spacing is provided to allow the doorbell region for a particular context to be mapped to a user process with MMU-based page protections. The size of the Doorbell MMIO region is dependent on the maximum supported doorbell stride and the maximum number of ring contexts supported:

Doorbell MMIO Region Size In Bytes =

(MMIO\_CAP1.max\_cxt + 1) \* 2\*\*(MMIO\_CAP0.db\_stride + 12)

The Doorbell MMIO region shall only be written; a read returns an undefined value.

The first 8-bytes of each section corresponds to the doorbell register for the matching context; the remaining bytes are reserved.

The doorbell register is used in the following way by software. After updating the context's Write\_Index location and adding new descriptors to the context's ring, software performs an aligned 64-bit MMIO write of the Write\_Index value, the doorbell\_value, to its associated doorbell register address. The doorbell\_value is evaluated per the rules specified in "4.3.3, Doorbell Register and Context Signaling".



#### Figure 9-1: Doorbell MMIO Regions