Abstract: The Serverless computing is new paradigm in cloud-native applications. The cloud-native applications are Resilient, Highly Available and Scalable, and achieved by CPU and/or Memory usage metrics. Scale-to-zero is to bring down application which is idle i.e., when it is not processing any request or where there is no computing. The standard benchmark CPU/Memory metrics would never be zero and hence not helpful in determining whether application is idle or not. Hence custom metrics would be defined and assessed to bring down the application by determining its idle state. On new request, application new instance boots up and process the request. This booting time can be seriously affecting the latency and throughput for requests which are real time in nature. To improve the responsiveness for real time applications, scale-to-near-zero can be incorporated. Scale-to-near-zero is where, small footprint application instance exists always, which is capable enough to process real-time request. This improves the responsiveness of real time applications, eliminates the warmup time, and maximizes chances of being deterministic. Full capacity application instance can be instantiated based on the current load of small footprint application. The discussed approach targeted for FaaS (Functions-As-A-Service) of serverless computing.
DESC:In serverless computing, the custom metrics are the benchmark to scale-to-zero. On the new request, application instances created & booted, before the actual request is processed. The bootup time interval between new request arrival and application instance ready to process it, the warmup time.
The warmup time could add latency at beginning of request processing, which impacts the overall completion time of the given request. This is a serious impact on the real time request for which the completion time is deterministic in nature. ,CLAIMS:[1] We claim to improve the responsiveness for real time applications by incorporating scale-to-near-zero method. Scale-to-near-zero is where, small footprint application instance exists always, which is capable enough to process real-time request.
[2] Addition to claim [1], Full capacity application instance can be instantiated based on the current load of small footprint application and thus improves the response time.
| # | Name | Date |
|---|---|---|
| 1 | 202141019511-8(i)-Substitution-Change Of Applicant - Form 6 [10-12-2022(online)].pdf | 2022-12-10 |
| 1 | 202141019511-STATEMENT OF UNDERTAKING (FORM 3) [28-04-2021(online)].pdf | 2021-04-28 |
| 2 | 202141019511-ASSIGNMENT DOCUMENTS [10-12-2022(online)].pdf | 2022-12-10 |
| 2 | 202141019511-PROVISIONAL SPECIFICATION [28-04-2021(online)].pdf | 2021-04-28 |
| 3 | 202141019511-FORM 1 [28-04-2021(online)].pdf | 2021-04-28 |
| 3 | 202141019511-FORM28 [10-12-2022(online)].pdf | 2022-12-10 |
| 4 | 202141019511-COMPLETE SPECIFICATION [24-11-2021(online)].pdf | 2021-11-24 |
| 4 | 202141019511-DECLARATION OF INVENTORSHIP (FORM 5) [28-04-2021(online)].pdf | 2021-04-28 |
| 5 | 202141019511-FORM-9 [24-11-2021(online)].pdf | 2021-11-24 |
| 6 | 202141019511-COMPLETE SPECIFICATION [24-11-2021(online)].pdf | 2021-11-24 |
| 6 | 202141019511-DECLARATION OF INVENTORSHIP (FORM 5) [28-04-2021(online)].pdf | 2021-04-28 |
| 7 | 202141019511-FORM 1 [28-04-2021(online)].pdf | 2021-04-28 |
| 7 | 202141019511-FORM28 [10-12-2022(online)].pdf | 2022-12-10 |
| 8 | 202141019511-ASSIGNMENT DOCUMENTS [10-12-2022(online)].pdf | 2022-12-10 |
| 8 | 202141019511-PROVISIONAL SPECIFICATION [28-04-2021(online)].pdf | 2021-04-28 |
| 9 | 202141019511-8(i)-Substitution-Change Of Applicant - Form 6 [10-12-2022(online)].pdf | 2022-12-10 |
| 9 | 202141019511-STATEMENT OF UNDERTAKING (FORM 3) [28-04-2021(online)].pdf | 2021-04-28 |