Abstract: Examples described herein relate to a switch device for a rack of two or more physical servers, wherein the switch device is coupled to the two or more physical servers and the switch device performs packet protocol processing termination for received packets and provides payload data from the received packets without a received packet header to a destination buffer of a destination physical server in the rack. In some examples, the switch device comprises at least one central processing unit, the at least one central processing unit is to execute packet processing operations on the received packets. In some examples, a physical server executes at least one virtualized execution environments (VEE) and the at least one central processing unit executes a VEE for packet processing of packets with data to be accessed by the physical server that executes the VEE.
Description:RELATED APPLICATIONS
[0001] This patent application is related to India Patent Application No. 202247066311, filed on 18 November 2022, entitled “SWITCH-MANAGED RESOURCE ALLOCATION AND SOFTWARE EXECUTION”.
[0002] This application claims the benefit of U.S. Application No. 16/905,761, filed June 18, 2020, the entire contents of which are hereby incorporated by reference herein.
DESCRIPTION
[0003] In the context of cloud computing, cloud service providers (CSPs) offer various services to other companies or individuals for use such as infrastructure as a service (IaaS), software as a service (SaaS) or platform as a service (PaaS). A hardware infrastructure including compute, memory, storage, accelerators, networking, and so forth executes and supports software stacks provided by the CSPs and their customers.
[0004] CSPs can have experience complex networking environments where packets are parsed, de-encapsulated, decrypted, and sent to a proper virtual machine (VM). In some cases, packet flows are balanced and metered to achieve service level agreement (SLA) requirements. In some cases, network processing occurs in the servers within a datacenter. However, with increased volumes of packets and increased amounts and complexity of packet processing activities, a burden on the servers is increasing. Central processing units (CPUs) or other server processor resources are used for packet processing, but CPUs and other processor resources can be used for other services that are billable or generate higher revenue than packet processing. The impact of this problem is significantly increased when using high bit-rate network devices such as the 100Gbps and higher speed networks.
Brief Description of the Drawings
[0005] FIGs. 1A-1D depict example switch systems.
[0006] FIG. 2A depicts an example overview of a system of managing resources in a rack.
[0007] FIG. 2B depicts an example overview of various management hierarchies.
[0008] FIG. 3 depicts an example system in which a switch can respond to a memory access request.
[0009] FIG. 4A shows examples of a Memcached server executing on a server and in a switch.
[0010] FIG. 4B shows the Ethernet packet flow for a single request.
[0011] FIGs. 5A-5C depict example systems in which packets can terminate at a switch.
[0012] FIG. 6 depicts an example of a switch that executes an orchestration control plane to manage what device executes a virtualized execution environment.
[0013] FIG. 7A depicts an example of migration of a virtualized execution environment from a server to another server.
[0014] FIG. 7B depicts an example of migration of a virtualized execution environment.
[0015] FIGs. 8A-8C depict example processes.
[0016] FIG. 9 depicts a system.
[0017] FIG. 10 depicts an environment.
[0018] FIG. 11 depicts an example network element.
Detailed Description
[0019] Within a data center, north-south traffic can include packets that flow in or out of the data center whereas east-west traffic can include packets that flow between nodes (e.g., racks of servers) within the data center. North-south traffic can be considered a product for serving customers, whereas east-west traffic can be considered overhead. The amount of east-west traffic has been growing at a rate that is significantly higher than north-south traffic and processing east-west traffic flow in a timely manner to comply with applicable SLAs while reducing data center total cost of ownership (TCO) is a growing challenge within the datacenter.
, Claims:1. A packaged integrated circuit, the packaged integrated circuit being configurable to be used in switching operations in association with at least one network, multiple graphics processing units (GPUs), multiple compute express link (CXL).mem memory devices, and multiple central processing units (CPUs), the packaged integrated circuit comprising:
interface circuitry to be communicatively coupled to the at least one network, the multiple GPUs, the multiple CXL.mem memory devices, and the multiple CPUs; and
switch circuitry to implement the switching operations in association with respective data communication processing, the switching operations to be carried out via the interface circuitry in association with the at least one network, the multiple GPUs, the multiple CXL.mem devices, and the multiple CPUs;
wherein:
the multiple CXL.mem devices are to be in a pooled configuration;
the switch circuitry is to carry out, at least in part, the respective data communication processing that is in association with the at least one network in accordance with remote direct memory access (RDMA) over Converged Ethernet (RoCE) protocol;
the switch circuitry is to carry out, at least in part, the respective data communication processing that is in association with the multiple GPUs and the multiple CPUs in accordance with peripheral component interconnect express (PCIe) protocol;
the switch circuitry is to carry out, at least in part, the respective data communication processing that is in association with the multiple CXL.mem memory devices in accordance with CXL protocol;
the switch circuitry is to implement the switching operations and/or the respective data communication processing in association with compute and/or accelerator resource aggregation and/or compute and/or accelerator resource composition;
the switching operations and/or the respective communication processing are software programmable, at least in part; and
the switch circuitry is to carry out, at least in part, the respective data communication processing that is in association with the multiple CXL.mem memory devices in association with memory page data transfer.
| # | Name | Date |
|---|---|---|
| 1 | 202448062774-POWER OF AUTHORITY [20-08-2024(online)].pdf | 2024-08-20 |
| 2 | 202448062774-FORM 1 [20-08-2024(online)].pdf | 2024-08-20 |
| 3 | 202448062774-DRAWINGS [20-08-2024(online)].pdf | 2024-08-20 |
| 4 | 202448062774-DECLARATION OF INVENTORSHIP (FORM 5) [20-08-2024(online)].pdf | 2024-08-20 |
| 5 | 202448062774-COMPLETE SPECIFICATION [20-08-2024(online)].pdf | 2024-08-20 |
| 6 | 202448062774-Proof of Right [22-11-2024(online)].pdf | 2024-11-22 |
| 7 | 202448062774-FORM 3 [22-11-2024(online)].pdf | 2024-11-22 |
| 8 | 202448062774-FORM 18 [28-01-2025(online)].pdf | 2025-01-28 |