cats T. Peng Internet-Draft Y. Ma Intended status: Informational Beijing Jiaotong University Expires: 20 August 2024 17 February 2024 The Computing-Aware Routing Architecture in Computing-Aware Traffic Steering draft-peng-cats-car-architecture-00 Abstract This document proposes a compute-aware routing (CAR) architecture for Computing-Aware Traffic Steering (CATS), designed to support routing systems with compute resource awareness and provide a standardized approach for network devices and services. Status of This Memo This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet- Drafts is at https://datatracker.ietf.org/drafts/current/. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." This Internet-Draft will expire on 20 August 2024. Copyright Notice Copyright (c) 2024 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust's Legal Provisions Relating to IETF Documents (https://trustee.ietf.org/ license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with respect to this document. Code Components extracted from this document must include Revised BSD License text as described in Section 4.e of the Trust Legal Provisions and are provided without warranty as described in the Revised BSD License. Peng & Ma Expires 20 August 2024 [Page 1] Internet-Draft CAR Architecture February 2024 Table of Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . 2 2. The Computing-Aware Routing Headers (CARH) . . . . . . . . . 2 3. Computing-Aware Info (CAI) . . . . . . . . . . . . . . . . . 3 4. Advanced Info . . . . . . . . . . . . . . . . . . . . . . . . 5 4.1. 0xF0 - Composite Performance Score (CPS) . . . . . . . . 5 4.2. 0xF1 - Network Quality Index (NQI) . . . . . . . . . . . 5 5. Security Considerations . . . . . . . . . . . . . . . . . . . 6 6. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 6 Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . 6 Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . . 6 1. Introduction With the continuous increase of computing resources and the popularity of distributed computing, how to fully utilize these dispersed computing resources to provide better services has become an important challenge. To address this challenge, Computing-Aware Traffic Steering (CATS) has been proposed. CATS requires the network to be able to perceive information about computing resources and select appropriate service instances based on the joint indicators of computing and the network, thereby achieving dynamic control of network traffic. In the implementation of CATS, a key task is to design a message format that can transmit computing resource information. Therefore, the Computing-Aware Routing (CAR) message format is proposed. The CAR is a flexible and powerful message transmission method designed to support computing resource-aware routing systems. 2. The Computing-Aware Routing Headers (CARH) Peng & Ma Expires 20 August 2024 [Page 2] Internet-Draft CAR Architecture February 2024 0 1 2 3 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ |Version| Time to Live | Length | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | | | Timestamp (64 bits) | | | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | CAI Type[0] | CAI Value[0] | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | ... | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ | CAI Type[n] | CAI Value[n] | +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ The CARH contains the following fields: * Version Number (4-bit): The version number indicates the CARH format version. * Time to Live (8-bit): Time to Live indicates the effective time of the resource information represented by CARH. * Length (16-bit): Length of the CARH. * Timestamp (64-bit): Timestamp for perceiving resource information. * CAI Type (8-bit): Computing-Aware Info Type (CAI Type) represents a type of computing resource and the order of magnitude corresponding to the specific value of that type of resource. This section corresponds one-to-one with the CAI Value. * CAI Value (24-bit): Computing-Aware Info Value (CAI Value) represents the specific value of the resource identified by the CAI Type. 3. Computing-Aware Info (CAI) In order to comprehensively evaluate the computing power in the network, CARH needs to introduce a series of dynamic performance indicators to capture real-time changes in computing resources. These indicators need to cover key performance parameters of network devices during operation, such as CPU Utilization, CPU Frequency, CPU Cores, Memory Utilization, Memory Frequency, Storage Utilization, GPU Memory, GPU Utilization, etc. Peng & Ma Expires 20 August 2024 [Page 3] Internet-Draft CAR Architecture February 2024 In CARH, use Computing-Aware Info (CAI) to carry these key performance parameters. Computing-Aware Info (CAI) consists of two parts: Computing-Aware Info Type (CAI Type) and Computing-Aware Info Value (CAI Value). Use the CAI Type field to represent the type of resource being carried, and use the CAI Value field to represent the numerical value and order of magnitude corresponding to the resource being carried. Here are some definitions of the CAI Type field. * 0x01 - CPU Utilization: Indicates the percentage of time the CPU processor is executing tasks. The calculation formula is the used CPU time divided by the total CPU time. * 0x02 - CPU Frequency: Refers to the number of instructions executed by the CPU per second, usually in hertz (Hz). * 0x03 - CPU Cores: Represents the number of physical cores in the CPU processor. Multi-core processors can execute multiple tasks simultaneously. * 0x04 - Memory Utilization: Indicates the percentage of system memory in use. The calculation formula uses memory divided by total memory. * 0x05 - Memory Frequency: Refers to the clock speed of RAM, indicating the amount of data that the memory module can transfer per second. * 0x06 - Storage Utilization: Indicates the percentage of storage capacity being used in the storage system. The calculation formula uses storage divided by total storage capacity. * 0x07 - GPU Memory: Refers to the memory on the graphics processing unit (GPU) used to store data required for graphics and computing tasks. * 0x08 - GPU Utilization: Indicates the percentage of time the GPU processor is performing tasks. The calculation formula is the used GPU time divided by the total GPU time. Peng & Ma Expires 20 August 2024 [Page 4] Internet-Draft CAR Architecture February 2024 4. Advanced Info Dynamic performance indicators can only serve as basic information transmitted in CARH. In fact, based on performance indicator information, some advanced performance indicators can be established according to specific network scenarios. Advanced info is also a supplementary definition of CAI Type. 4.1. 0xF0 - Composite Performance Score (CPS) From the perspective of computing power, a computing power weight allocation mechanism can be introduced to assign appropriate weights to different computing power factors. This mechanism gives appropriate weights to different computing power factors to more accurately reflect their relative importance when calculating comprehensive performance indicators. For example, in some scenarios, CPU utilization may be more critical, while memory usage may be more important in other scenarios. This can be customized by network administrators according to specific needs, or adjusted through CATS intelligent algorithms for Self- Adaptation. Therefore, a comprehensive scoring model can be proposed to take into account various computing power factors and generate a comprehensive score. This score can be a standardized value that reflects the overall computing power status of the system. The formulation of a comprehensive scoring model can combine Data Analysis and network topology features. 4.2. 0xF1 - Network Quality Index (NQI) From the perspective of network status, a network quality measurement mechanism can be introduced to evaluate network quality. Network Quality Index (NQI) takes into account several important performance indicators in the network, including latency, packet loss rate, bandwidth utilization, etc. The goal of NQI is to provide a single metric to measure the overall quality of the network, so that network administrators and system operators can more easily understand the performance of the network and take corresponding measures to optimize the network. Peng & Ma Expires 20 August 2024 [Page 5] Internet-Draft CAR Architecture February 2024 The definition of NQI is based on the weighted average of a series of network performance indicators. Each performance indicator is assigned a weight, which reflects the relative importance of each indicator to network quality. Then, by multiplying the values of each indicator with its corresponding weight and adding all weighted values, a comprehensive score, namely NQI, is obtained. 5. Security Considerations TBD. 6. IANA Considerations This document has no IANA actions. Acknowledgments TODO acknowledge. Authors' Addresses Tianhao Peng Beijing Jiaotong University Email: th.peng@bjtu.edu.cn Yuyin Ma Beijing Jiaotong University Email: mayuyin@bjtu.edu.cn Peng & Ma Expires 20 August 2024 [Page 6]