Sign In to Follow Application
View All Documents & Correspondence

System And Method For Inferencing One Or More Trained Models

Abstract: ABSTRACT SYSTEM AND METHOD FOR INFERENCING ONE OR MORE TRAINED MODELS The present disclosure relates to a method for training a model by one or more processors (202). The method includes retrieving data from at least one of, an existing data source or a new data source. Further, the method includes categorizing the data as at least one of, historic data and current data. Further, the method includes automatically pre-processing the historic data and the current data. Further, the method includes autotuning one or more hyperparameters for the pre-processed historic data and the current data. Further, the method includes training the model with the historic data and the current data. The historic data and the current data are pre-processed and autotuned with the one or more hyperparameters. Further, the method includes generating utilizing the one or more re-trained models, one or more inferences based on a user inputting a portion of a search query via a user interface (206). Ref. FIG. 5

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #
Filing Date
06 October 2023
Publication Number
15/2025
Publication Type
INA
Invention Field
COMPUTER SCIENCE
Status
Email
Parent Application

Applicants

JIO PLATFORMS LIMITED
OFFICE-101, SAFFRON, NR. CENTRE POINT, PANCHWATI 5 RASTA, AMBAWADI, AHMEDABAD, GUJARAT, INDIA

Inventors

1. Aayush Bhatnagar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
2. Ankit Murarka
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
3. Jugal Kishore
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
4. Chandra Ganveer
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
5. Sanjana Chaudhary
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
6. Gourav Gurbani
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
7. Yogesh Kumar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
8. Avinash Kushwaha
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
9. Dharmendra Kumar Vishwakarma
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
10. Sajal Soni
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
11. Niharika Patnam
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
12. Shubham Ingle
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
13. Harsh Poddar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
14. Sanket Kumthekar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
15. Mohit Bhanwria
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
16. Shashank Bhushan
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
17. Vinay Gayki
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
18. Aniket Khade
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
19. Durgesh Kumar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
20. Zenith Kumar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
21. Gaurav Kumar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
22. Manasvi Rajani
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
23. Kishan Sahu
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
24. Sunil Meena
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
25. Supriya Kaushik De
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
26. Kumar Debashish
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
27. Mehul Tilala
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
28. Satish Narayan
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
29. Rahul Kumar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
30. Harshita Garg
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
31. Kunal Telgote
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
32. Ralph Lobo
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
33. Girish Dange
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India

Specification

DESC: FORM 2

THE PATENTS ACT, 1970
(39 of 1970)
&
THE PATENTS RULES, 2003

COMPLETE SPECIFICATION
(See section 10 and rule 13)
1. TITLE OF THE INVENTION

SYSTEM AND METHOD FOR INFERENCING ONE OR MORE TRAINED MODELS
2. APPLICANT(S)
NAME NATIONALITY ADDRESS
JIO PLATFORMS LIMITED INDIAN OFFICE-101, SAFFRON, NR. CENTRE POINT, PANCHWATI 5 RASTA, AMBAWADI, AHMEDABAD 380006, GUJARAT, INDIA
3.PREAMBLE TO THE DESCRIPTION

THE FOLLOWING SPECIFICATION PARTICULARLY DESCRIBES THE NATURE OF THIS INVENTION AND THE MANNER IN WHICH IT IS TO BE PERFORMED.

FIELD OF THE INVENTION
[0001] The present invention relates to the field of network data analytics for predictive network management and, more specifically, to a system and a method thereof to infer selected models which are already trained or are being trained in a user interactive manner.
BACKGROUND OF THE INVENTION
[0002] With advancement of technology, there is a demand for a telecommunication service to induce up-to-date features into the scope of provision. To enhance user experience and implement advanced monitoring mechanisms, prediction methodologies are being incorporated in a network management service. An advanced prediction system integrated with an Artificial intelligence (AI)/Machine learning (ML) system excels in executing a wide array of algorithms and predictive tasks. The advanced prediction system is underpinned by the capabilities of Large Language Models (LLMs). Its primary mission centers around the comprehensive analysis of both network data and operational data, capitalizing on the advanced techniques of machine learning (ML) to glean profound insights. The LLM retrains the existing trained model for new data source or existing data source.
[0003] An advanced AI/ML integrated system is capable of data analysis and making predictions. There are trained models developed by users to perform specific predictions or analyses. Some AI/ML predictive systems implement large language models (LLMs) as a service, enabling the retraining of existing models such as GPT-J, LLaMA2, Bloom, GPT-Neo, and Falcon with either existing or new data sources. This approach helps users save time and optimize system resources.
[0004] However, these systems lack an interactive inference mechanism. Contemporary predictive systems do not allow authorized users to train and evaluate models developed by others. This limitation results in unnecessary storage and resource usage, requiring the deployment and training of a new model each time a prediction or analysis is needed, leading to wasted time, resources, and human effort.
[0005] There is a need for a reliable, interactive inference mechanism that enables users to deploy any model within a network system. Furthermore, a system and method are required to infer selected models in a predictive system and retrain them as necessary.
SUMMARY OF THE INVENTION
[0006] One or more embodiments of the present disclosure provide a system and a method for inferencing one or more trained models.
[0007] In one aspect of the present invention, the method for inferencing one or more trained models is disclosed. The method includes retrieving, by one or more processors, data from at least one of: an existing data source or a new data source. Further, the method includes categorizing, by the one or more processors, the data as at least one of: historic data and current data. Further, the method includes pre-processing, by the one or more processors, the historic data and the current data. Further, the method includes autotuning, by the one or more processors, one or more hyperparameters for the pre-processed historic data and the current data. Further, the method includes retraining, by the one or more processors, the one or more trained model with the historic data and the current data. The historic data and the current data are pre-processed and autotuned with the one or more hyperparameters. Further, the method includes generating, by the one or more processors, utilizing the one or more re-trained models, one or more inferences based on a user inputting a portion of a search query via a user interface.
[0008] In an embodiment, retrieving, data from the existing data source includes selecting the existing data source via the user interface by a user.
[0009] In an embodiment, retrieving, the data from the new data source, includes creating, by the one or more processors, the new data source when the user selects one or more data sources from a list of data sources via the user interface, pulling, by the one or more processors, the data from one or more data sources and storing the data at the new data source, and retrieving, by the one or more processors, the data from the new data source.
[0010] In an embodiment, the new data source includes at least one of, a file input, a source path, input stream, Hypertext Transfer Protocol2 (HTTP2), Hadoop Distributed File System (HDFS) and a Network Attached Storage (NAS).
[0011] In an embodiment, the data is categorized as at least one of, the historic data and the current data based on a time of generation of the data.
[0012] In an embodiment, pre-processing of the historic data and the current data, includes includes at least one of, cleaning and normalizing text content, removing and/or formatting tags, removing irrelevant elements and removing noise.
[0013] In an embodiment, further, the method includes creating, by the one or more processors, a training name and a model name for retraining the one or more trained model based on receiving the training name and the user selecting the model’s name from a list of model names via the user interface. Further, the method includes setting, by the one or more processors, a version based on the created training name. Further, the method includes allocating, by the one or more processors, one or more network elements for training the model based on the user selecting the one or more network elements for training the model via the user interface.
[0014] In an embodiment, the method further includes notifying, by the one or more processors, the user, a status of re-training the one or more trained models. The status of notifying the user of training the one or more models, includes at least one of, status of completion of retraining the existing trained one or more models utilizing one or more identifiers including at least one of, training name, model name, version, type and/or name of the data source used and one or more actions including at least one of, retrain or delete the trained model.
[0015] In an embodiment, generating, utilizing the one or more re-trained models, one or more inferences based on the user inputting the portion of the search query via the user interface, includes the steps of: checking, by the one or more processors, utilizing the one or more re-trained models, whether characters of the at least the portion of the search query is present in one or more historic search queries present in the historic data, constructing, by the one or more processors, utilizing the one or more re-trained models, one or more complete search queries by adding remaining characters from the one or more historic search queries to the at least the portion of the search query, when determined that the characters of the at least the portion of the search query is present in the one or more historic search queries based on checking, autosuggesting, by the one or more processors, utilizing the one or more re-trained models, the one or more complete search queries to the user, and generating, by the one or more processors, utilizing the one or more re-trained models, the one or more inferences based on checking similar inferences present in the trained data using the complete search query.
[0016] In an embodiment, the one or more inferences are generated in a format including at least one of, a text based format.
[0017] In an embodiment, the method includes the step of: providing, by the one or more processors, a plurality of options to select the one or more trained models for inferencing, wherein the plurality of options includes at least one of, options of the one or more trained models trained by the user or the options of the one or more trained models trained by third parties.
[0018] In one aspect of the present invention, the system for inferencing one or more trained modelsis disclosed. The system includes a retrieving unit, a categorizing unit, a pre-processing unit, a tuning unit, a training unit, a generating unit and a notifying unit. The retrieving unit is configured to retrieve, data from at least one of, an existing data source or a new data source. The categorizing unit is configured to categorize the data as at least one of, historic data and current data. The pre-processing unit is configured to automatically pre-process, the historic data and the current data. The tuning unit is configured to autotune, one or more hyperparameters for the pre-processed historic data and the current data. The training unit is configured to retrain the one or more trained model with the historic data and the current data. The historic data and the current data are pre-processed and autotuned with the one or more hyperparameters. The generating unit is configured to, generate, utilizing the one or more re-trained models, one or more inferences based on a user inputting a portion of a search query via the user interface The notifying unit is configured to notify a status of retraining the one or more trained model to a user.
[0019] In one aspect of the present invention, a non-transitory computer-readable medium having stored thereon computer-readable instructions is provided. The computer-readable instructions causes the processor to retrieve, data from at least one of: an existing data source or a new data source. Further, the processor categorizes the data as at least one of, historic data and current data. Further, the processor pre-processes the historic data and the current data. Further, the processor autotunes one or more hyperparameters for the pre-processed historic data and the current data. Further, the processor retrains the one or more trained model with the historic data and the current data. The historic data and the current data are pre-processed and autotuned with the one or more hyperparameters. Further, the processor generates, utilizing the one or more re-trained models, one or more inferences based on a user inputting a portion of a search query via a user interface.
[0020] Other features and aspects of this invention will be apparent from the following description and the accompanying drawings. The features and advantages described in this summary and in the following detailed description are not all-inclusive, and particularly, many additional features and advantages will be apparent to one of ordinary skill in the relevant art, in view of the drawings, specification, and claims hereof. Moreover, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes and may not have been selected to delineate or circumscribe the inventive subject matter, resort to the claims being necessary to determine such inventive subject matter.

BRIEF DESCRIPTION OF THE DRAWINGS
[0021] The accompanying drawings, which are incorporated herein, and constitute a part of this disclosure, illustrate exemplary embodiments of the disclosed methods and systems in which like reference numerals refer to the same parts throughout the different drawings. Components in the drawings are not necessarily to scale, emphasis instead being placed upon clearly illustrating the principles of the present disclosure. Some drawings may indicate the components using block diagrams and may not represent the internal circuitry of each component. It will be appreciated by those skilled in the art that disclosure of such drawings includes disclosure of electrical components, electronic components or circuitry commonly used to implement such components.
[0022] FIG. 1 is an exemplary block diagram of an environment for inferencing one or more trained models, according to various embodiments of the present disclosure.
[0023] FIG. 2 is a block diagram of a system of FIG. 1, according to various embodiments of the present disclosure.
[0024] FIG. 3 is an example schematic representation of the system of FIG. 1 in which various entities operations are explained, according to various embodiments of the present system.
[0025] FIG. 4 illustrates a system architecture for inferencing one or more trained models, according to various embodiments of the present system.
[0026] FIG. 5 is an exemplary flow diagram illustrating the method for inferencing one or more trained models, according to various embodiments of the present disclosure.
[0027] FIG. 6 is an example flow diagram illustrating the method for inferencing one or more trained models, according to various embodiments of the present disclosure.
[0028] Further, skilled artisans will appreciate that elements in the drawings are illustrated for simplicity and may not have necessarily been drawn to scale. For example, the flow charts illustrate the method in terms of the most prominent steps involved to help to improve understanding of aspects of the present invention. Furthermore, in terms of the construction of the device, one or more components of the device may have been represented in the drawings by conventional symbols, and the drawings may show only those specific details that are pertinent to understanding the embodiments of the present invention so as not to obscure the drawings with details that will be readily apparent to those of ordinary skill in the art having benefit of the description herein.
[0029] The foregoing shall be more apparent from the following detailed description of the invention.

DETAILED DESCRIPTION OF THE INVENTION
[0030] Some embodiments of the present disclosure, illustrating all its features, will now be discussed in detail. It must also be noted that as used herein and in the appended claims, the singular forms "a", "an" and "the" include plural references unless the context clearly dictates otherwise.
[0031] Various modifications to the embodiment will be readily apparent to those skilled in the art and the generic principles herein may be applied to other embodiments. However, one of ordinary skill in the art will readily recognize that the present disclosure including the definitions listed here below are not intended to be limited to the embodiments illustrated but is to be accorded the widest scope consistent with the principles and features described herein.
[0032] A person of ordinary skill in the art will readily ascertain that the illustrated steps detailed in the figures and here below are set out to explain the exemplary embodiments shown, and it should be anticipated that ongoing technological development will change the manner in which particular functions are performed. These examples are presented herein for purposes of illustration, and not limitation. Further, the boundaries of the functional building blocks have been arbitrarily defined herein for the convenience of the description. Alternative boundaries can be defined so long as the specified functions and relationships thereof are appropriately performed. Alternatives (including equivalents, extensions, variations, deviations, etc., of those described herein) will be apparent to persons skilled in the relevant art(s) based on the teachings contained herein. Such alternatives fall within the scope and spirit of the disclosed embodiments.
[0033] Before discussing example, embodiments in more detail, it is to be noted that the drawings are to be regarded as being schematic representations and elements that are not necessarily shown to scale. Rather, the various elements are represented such that their function and general purpose becomes apparent to a person skilled in the art. Any connection or coupling between functional blocks, devices, components, or other physical or functional units shown in the drawings or described herein may also be implemented by an indirect connection or coupling. A coupling between components may also be established over a wireless connection. Functional blocks may be implemented in hardware, firmware, software or a combination thereof.
[0034] Further, the flowcharts provided herein, describe the operations as sequential processes. Many of the operations may be performed in parallel, concurrently or simultaneously. In addition, the order of operations maybe re-arranged. The processes may be terminated when their operations are completed, but may also have additional steps not included in the figured. It should be noted, that in some alternative implementations, the functions/acts/ steps noted may occur out of the order noted in the figured. For example, two figures shown in succession may, in fact, be executed substantially concurrently, or may sometimes be executed in the reverse order, depending upon the functionality/acts involved.
[0035] Further, the terms first, second etc… may be used herein to describe various elements, components, regions, layers and/or sections, it should be understood that these elements, components, regions, layers and/or sections should not be limited by these terms. These terms are used only to distinguish one element, component, region, layer or section from another region, layer, or a section. Thus, a first element, component, region layer, or section discussed below could be termed a second element, component, region, layer, or section without departing form the scope of the example embodiments.
[0036] Spatial and functional relationships between elements (for example, between modules) are described using various terms, including “connected,” “engaged,” “interfaced,” and “coupled.” Unless explicitly described as being “direct,” when a relationship between first and second elements is described in the description below, that relationship encompasses a direct relationship where no other intervening elements are present between the first and second elements, and also an indirect relationship where one or more intervening elements are present (either spatially or functionally) between the first and second elements. In contrast, when an element is referred to as being "directly” connected, engaged, interfaced, or coupled to another element, there are no intervening elements present. Other words used to describe the relationship between elements should be interpreted in a like fashion (e.g., "between," versus "directly between," "adjacent," versus "directly adjacent," etc.).
[0037] The terminology used herein is for the purpose of describing particular example embodiments only and is not intended to be limiting. Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which example embodiments belong. It will be further understood that terms, e.g., those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
[0038] As used herein, the singular forms “a,” “an,” and “the,” are intended to include the plural forms as well, unless the context clearly indicates otherwise. As used herein, the terms “and/or” and “at least one of” include any and all combinations of one or more of the associated listed items. It will be further understood that the terms “comprises,” “comprising,” “includes,” and/or “including,” when used herein, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
[0039] Unless specifically stated otherwise, or as is apparent from the description, terms such as “processing” or “computing” or “calculating” or “determining” of “displaying” or the like, refer to the action and processes of a computer system, or similar electronic computing device/hardware, that manipulates and transforms data represented as physical, electronic quantities within the computer system's registers and memories into other data similarly represented as physical quantities within the computer system memories or registers or other such information storage, transmission or display devices.
[0040] Various embodiments of the present invention provide a system and a method to infer selected models which are already trained or are being trained in a user interactive manner. The system is also configured for retraining of existing trained model with the processed existing data source or the new data source so as to improve accuracy of the trained model.
[0041] FIG. 1 illustrates an exemplary block diagram of an environment (100) for inferencing one or more trained models (e.g., ML model, AI model, supervised learning, deep learning, LLM or the like), according to various embodiments of the present disclosure. The environment (100) comprises a plurality of user equipment’s (UEs) (102-1, 102-2, ……,102-n). The at least one UE (102-n) from the plurality of the UEs (102-1, 102-2, ……102-n) is configured to connect to a system (108) via a communication network (106). Hereafter, label for the plurality of UEs or one or more UEs is 102.
[0042] In accordance with yet another aspect of the exemplary embodiment, the plurality of UEs (102) may be a wireless device or a communication device that may be a part of the system (108). The wireless device or the UE (102) may include, but are not limited to, a handheld wireless communication device (e.g., a mobile phone, a smart phone, a phablet device, and so on), a wearable computer device (e.g., a head-mounted display computer device, a head-mounted camera device, a wristwatch, a computer device, and so on), a laptop computer, a tablet computer, or another type of portable computer, a media playing device, a portable gaming system, and/or any other type of computer device with wireless communication or Voice Over Internet Protocol (VoIP) capabilities. In an embodiment, the UEs (102) may include, but are not limited to, any electrical, electronic, electro-mechanical or an equipment or a combination of one or more of the above devices such as virtual reality (VR) devices, augmented reality (AR) devices, laptop, a general-purpose computer, desktop, personal digital assistant, tablet computer, mainframe computer, or any other computing device, where the computing device may include one or more in-built or externally coupled accessories including, but not limited to, a visual aid device such as camera, audio aid, a microphone, a keyboard, input devices for receiving input from a user such as touch pad, touch enabled screen, electronic pen and the like. It may be appreciated that the UEs (102) may not be restricted to the mentioned devices and various other devices may be used. A person skilled in the art will appreciate that the plurality of UEs (102) may include a fixed landline, and a landline with assigned extension within the communication network (106).
[0043] The communication network (106), may use one or more communication interfaces/protocols such as, for example, Voice Over Internet Protocol (VoIP), 802.11 (Wi-Fi), 802.15 (including Bluetooth™), 802.16 (Wi-Max), 802.22, Cellular standards such as Code Division Multiple Access (CDMA), CDMA2000, Wideband CDMA (WCDMA), Radio Frequency Identification (e.g., RFID), Infrared, laser, Near Field Magnetics, etc.
[0044] The communication network (106) includes, by way of example but not limitation, one or more of a wireless network, a wired network, an internet, an intranet, a public network, a private network, a packet-switched network, a circuit-switched network, an ad hoc network, an infrastructure network, a Public-Switched Telephone Network (PSTN), a cable network, a cellular network, a satellite network, a fiber optic network, or some combination thereof. The communication network (106) may include, but is not limited to, a Third Generation (3G) network, a Fourth Generation (4G) network, a Fifth Generation (5G) network, a Sixth Generation (6G) network, a New Radio (NR) network, a Narrow Band Internet of Things (NB-IoT) network, an Open Radio Access Network (O-RAN), and the like.
[0045] The communication network (106) may also include, by way of example but not limitation, at least a portion of one or more networks having one or more nodes that transmit, receive, forward, generate, buffer, store, route, switch, process, or a combination thereof, etc. one or more messages, packets, signals, waves, voltage or current levels, some combination thereof, or so forth. The communication network (106) may also include, by way of example but not limitation, one or more of a wireless network, a wired network, an internet, an intranet, a public network, a private network, a packet-switched network, a circuit-switched network, an ad hoc network, an infrastructure network, a Public-Switched Telephone Network (PSTN), a cable network, a cellular network, a satellite network, a fiber optic network, a VOIP or some combination thereof.
[0046] One or more network elements can be, for example, but not limited to a base station that is located in the fixed or stationary part of the communication network (106). The base station may correspond to a remote radio head, a transmission point, an access point or access node, a macro cell, a small cell, a micro cell, a femto cell, a metro cell. The base station enables transmission of radio signals to the UE (102) or a mobile transceiver. Such a radio signal may comply with radio signals as, for example, standardized by a 3rd Generation Partnership Project (3GPP) or, generally, in line with one or more of the above listed systems. Thus, a base station may correspond to a NodeB, an eNodeB, a Base Transceiver Station (BTS), an access point, a remote radio head, a transmission point, which may be further divided into a remote unit and a central unit. The 3GPP specifications cover cellular telecommunications technologies, including radio access, core network, and service capabilities, which provide a complete system description for mobile telecommunications.
[0047] The system (108) is communicatively coupled to a server (104) via the communication network (106). The server (104) can be, for example, but not limited to a standalone server, a server blade, a server rack, an application server, a bank of servers, a business telephony application server (BTAS), a server farm, a cloud server, an edge server, home server, a virtualized server, one or more processors executing code to function as a server, or the like. In an implementation, the server (104) may operate at various entities or a single entity (include, but is not limited to, a vendor side, a service provider side, a network operator side, a company side, an organization side, a university side, a lab facility side, a business enterprise side, a defense facility side, or any other facility) that provides service.
[0048] The environment (100) further includes the system (108) communicably coupled to the server (e.g., remote server or the like) (104) and each UE of the plurality of UEs (102) via the communication network (106). The remote server (104) is configured to execute the requests in the communication network (106).
[0049] The system (108) is adapted to be embedded within the remote server (104) or is embedded as an individual entity. The system (108) is designed to provide a centralized and unified view of data and facilitate efficient business operations. The system (108) is authorized to access to update/create/delete one or more parameters of their relationship between the requests for training the model, which gets reflected in real-time independent of the complexity of network.
[0050] In another embodiment, the system (108) may include an enterprise provisioning server (for example), which may connect with the remote server (104). The enterprise provisioning server provides flexibility for enterprises, ecommerce, finance to update/create/delete information related to the requests for the training the model in real time as per their business needs.
[0051] The system (108) may include, by way of example but not limitation, one or more of a standalone server, a server blade, a server rack, a bank of servers, a business telephony application server (BTAS), a server farm, hardware supporting a part of a cloud service or system, a home server, hardware running a virtualized server, one or more processors executing code to function as a server, one or more machines performing server-side functionality as described herein, at least a portion of any of the above, some combination thereof. In an implementation, system (108) may operate at various entities or single entity (for example include, but is not limited to, a vendor side, service provider side, a network operator side, a company side, an organization side, a university side, a lab facility side, a business enterprise side, ecommerce side, finance side, a defense facility side, or any other facility) that provides service.
[0052] However, for the purpose of description, the system (108) is described as an integral part of the remote server (104), without deviating from the scope of the present disclosure. Operational and construction features of the system (108) will be explained in detail with respect to the following figures.
[0053] FIG. 2 illustrates a block diagram of the system (108) provided for inferencing one or more trained models (e.g., AI model, ML model (such as LLMs), or the like), according to one or more embodiments of the present invention. As per the illustrated embodiment, the system (108) includes the one or more processors (202), the memory (204), an input/output interface unit (206), a display (208), an input device (210), and the database (214). Further the system (108) may comprise one or more processors (202). The one or more processors (202), hereinafter referred to as the processor (202) may be implemented as one or more microprocessors, microcomputers, microcontrollers, digital signal processors, central processing units, state machines, logic circuitries, single board computers, and/or any devices that manipulate signals based on operational instructions. As per the illustrated embodiment, the system (108) includes one processor. However, it is to be noted that the system (108) may include multiple processors as per the requirement and without deviating from the scope of the present disclosure.
[0054] An information related to the trained model may be provided or stored in the memory (204) of the system (108). Among other capabilities, the processor (202) is configured to fetch and execute computer-readable instructions stored in the memory (204). The memory (204) may be configured to store one or more computer-readable instructions or routines in a non-transitory computer-readable storage medium, which may be fetched and executed to create or share data packets over a network service. The memory (204) may include any non-transitory storage device including, for example, volatile memory such as RAM, or non-volatile memory such as disk memory, EPROMs, FLASH memory, unalterable memory, and the like.
[0055] The memory (204) may comprise any non-transitory storage device including, for example, volatile memory such as Random-Access Memory (RAM), or non-volatile memory such as Electrically Erasable Programmable Read-only Memory (EPROM), flash memory, and the like. In an embodiment, the system (108) may include an interface(s). The interface(s) may comprise a variety of interfaces, for example, interfaces for data input and output devices, referred to as input/output (I/O) devices, storage devices, and the like. The interface(s) may facilitate communication for the system. The interface(s) may also provide a communication pathway for one or more components of the system. Examples of such components include, but are not limited to, processing unit/engine(s) and the database (214). The processing unit/engine(s) may be implemented as a combination of hardware and programming (for example, programmable instructions) to implement one or more functionalities of the processing engine(s).
[0056] The information related to the trained model may further be configured to render on the user interface (206). The user interface (206) may include functionality similar to at least a portion of functionality implemented by one or more computer system interfaces such as those described herein and/or generally known to one having ordinary skill in the art. The user interface (206) may be rendered on the display (208), implemented using Liquid Crystal Display (LCD) display technology, Organic Light-Emitting Diode (OLED) display technology, and/or other types of conventional display technology. The display (208) may be integrated within the system (108) or connected externally. Further the input device(s) (210) may include, but not limited to, keyboard, buttons, scroll wheels, cursors, touchscreen sensors, audio command interfaces, magnetic strip reader, optical scanner, etc.
[0057] The database (214) may be communicably connected to the processor (202) and the memory (204). The database (214) may be configured to store and retrieve the request pertaining to features, or services or workflow of the system (108), access rights, attributes, approved list, and authentication data provided by an administrator. In another embodiment, the database (214) may be outside the system (108) and communicated through a wired medium and a wireless medium.
[0058] Further, the processor (202), in an embodiment, may be implemented as a combination of hardware and programming (for example, programmable instructions) to implement one or more functionalities of the processor (202). In the examples described herein, such combinations of hardware and programming may be implemented in several different ways. For example, the programming for the processor (202) may be processor-executable instructions stored on a non-transitory machine-readable storage medium and the hardware for the processor (202) may comprise a processing resource (for example, one or more processors), to execute such instructions. In the present examples, the memory (204) may store instructions that, when executed by the processing resource, implement the processor (202). In such examples, the system (108) may comprise the memory (204) storing the instructions and the processing resource to execute the instructions, or the memory (204) may be separate but accessible to the system (108) and the processing resource. In other examples, the processor (202) may be implemented by an electronic circuitry.
[0059] In order for the system (108) to inference the one or more trained models, the processor (202) includes a retrieving unit (216), a categorizing unit (218), a pre-processing unit (220), a tuning unit (222), a training unit (224), a notifying unit (226), a creating unit (228), a version setting unit (230), an allocating unit (232), a generating unit (234) and an option unit (236). The retrieving unit (216), the categorizing unit (218), the pre-processing unit (220), the tuning unit (222), the training unit (224), the notifying unit (226), the creating unit (228), the version setting unit (230), the allocating unit (232), the generating unit (234) and the option unit (236) may be implemented as a combination of hardware and programming (for example, programmable instructions) to implement one or more functionalities of the processor (202). In the examples described herein, such combinations of hardware and programming may be implemented in several different ways. For example, the programming for the processor (202) may be processor-executable instructions stored on a non-transitory machine-readable storage medium and the hardware for the processor (202) may comprise a processing resource (for example, one or more processors), to execute such instructions. In the present examples, the memory (204) may store instructions that, when executed by the processing resource, implement the processor. In such examples, the system (108) may comprise the memory (204) storing the instructions and the processing resource to execute the instructions, or the memory (204) may be separate but accessible to the system (108) and the processing resource. In other examples, the processor (202) may be implemented by the electronic circuitry.
[0060] In order for the system (108) to inference the one or more trained models, the retrieving unit (216), the categorizing unit (218), the pre-processing unit (220), the tuning unit (222), the training unit (224), the notifying unit (226), the creating unit (228), the version setting unit (230), the allocating unit (232), the generating unit (234) the option unit (236) are communicably coupled to each other.
[0061] The retrieving unit (216) retrieves data from an existing data source or a new data source. The existing data source can be, for example, but not limited to Generative Pre-Trained Transformers-Jumbo (GPT-J), Large Language Model Meta AI2 (LAMA2), Bloom, Generative Pre-Trained Transformers -neo (GPT-neo) and Falcon etc. In an embodiment, the retrieving unit (216) retrieves the data from the existing data source by selecting the existing data source via the user interface (206) by the user. In an example, if the user selects the existing data source then, the user of the system (108) will select the data source from the list of existing data sources.
[0062] In another embodiment, the retrieving unit (216) creates the new data source when the user selects one or more data sources from a list of data sources via the user interface (206). Further, the retrieving unit (216) pulls the data from one or more data sources and stores the data at the new data source. Further, the retrieving unit (216) retrieves the data from the new data source. In simple terms, if the user of the system (108) selects the new data source then, the user of the system (108) creates the new data source by retrieving the data from data as file input or the data from the source path, an input stream, a Hypertext Transfer Protocol (HTTP), Hadoop Distributed File Systems (HDFS) and a network attached storage (NAS).
[0063] Further, the categorizing unit (218) categorizes the data as at least one of historic data and current data. In an embodiment, the data is categorized as at least one of: the historic data and the current data based on a time of generation of the data.
[0064] In an example, consider a retail shop company that uses the machine learning model to predict sales trends based on the historical data and the current data. The historic data means sales data from the last five years (e.g., daily sales figures, customer details or the like). This data is categorized based on the time it was generated, so any data prior to the last six months is labeled as historic. The current data means the sales data from the last six months, which reflects recent trends and changes in consumer behavior. The current data includes daily sales figures, promotional events, seasonal influences or the like. When the new sales data is generated, the categorizing unit (218) analyzes the timestamp of this data. If the data is from today or the past six months, the categorizing unit (218) categorizes the data as current data. If the data is older than six months, the categorizing unit (218) categorizes the data as the historic data.
[0065] The pre-processing unit (220) automatically pre-processes the historic data and the current data. In an embodiment, the pre-processing unit (220) cleans the historic data and the current data by removing at least one of, duplicates and correcting data types. Further, the pre-processing unit (220) transforms the cleaned historic and the current data by at least one of converting categorical variables into numerical format and encoding the data. Further, the pre-processing unit (220) removes irrelevant or redundant features from the historic and the current data. Further, the pre-processing unit (220) normalizes the historic data and the current data by converting one or more tokens to a base format. The one or more tokens are present in at least one of the existing data source or the new data source.
[0066] In an example, consider, a company wants to improve its sentiment analysis model based on the customer reviews. The dataset includes both historic data (from previous years) and current data (recent submissions). The reviews from the previous years may contain inconsistent formatting (e.g., "Great product!" vs. "great Product"). The pre-processing step would normalize these to a consistent format, such as converting all text to lowercase. Also, the recent reviews may include slang or abbreviations (e.g., "LOL" or "BFF"). The normalizing could involve replacing these with their full meanings or using a predefined dictionary to standardize expressions.
[0067] Further, older reviews might contain HTML or XML tags (e.g., Great product!). The pre-processing would involve stripping these tags to retain only the text content. Similar formatting issues can occur with recent reviews, especially if the users copy and paste from different platforms. These tags would also be removed to standardize the dataset.
[0068] Further, the reviews might include irrelevant information such as product codes or author details that do not contribute to sentiment analysis. These would be removed to streamline the data. Also, the recent reviews may include promotional links or unrelated comments. The pre-processing would involve identifying and removing these elements (e.g., promotional links or unrelated comments).
[0069] In an example, the reviews might contain repeated phrases or emojis that add noise. The pre-processing step would involve removing the repeated phrases or emojis to ensure the model focuses on the sentiment conveyed.
[0070] In an example, the pre-processing unit (220) has normalized both the historic and current data by converting the purchase amounts to a base format (for example, USD) and standardizing product categories. This ensures consistency and allows the machine learning model to effectively learn from the combined dataset, improving its accuracy in predicting customer behavior.
[0071] The tuning unit (222) autotunes one or more hyperparameters for the pre-processed historic data and the current data. In an example, the tuning unit (222) effectively automates a hyperparameter tuning process, so as to improve the ML model accuracy by utilizing both historical and current datasets. This approach helps ensure that the ML model is robust and well-suited to predict customer churn accurately.
[0072] For example, during the hyperparameter tuning, the tuning unit (222) uses a GridSearchCV to autotune the hyperparameters. In an example,
from sklearn.model_selection import GridSearchCV
param_grid = {
'n_estimators': [100, 200],
'max_depth': [None, 10, 20],
'min_samples_split': [2, 5, 10]
}
grid_search = GridSearchCV(model, param_grid, cv=5)
grid_search.fit(historical_data.drop('target', axis=1), historical_data['target'])
[0073] Further, the training unit (224) retrains the one or more trained models with the historic data and the current data. The historic data and the current data are pre-processed and autotuned with the one or more hyperparameters. After the pre-processing and autotuning, the cleaned dataset (comprising both historic and current data) can be fed into the machine learning model. This improves the model's accuracy and effectiveness in understanding sentiment by reducing variability and noise in the data.
[0074] Further, the generating unit (234) generates, utilizing the one or more re-trained models, one or more inferences based on the user inputting a portion of a search query via the user interface (206). In an embodiment, the generating unit (234) receives at least the portion of the search query from the user. Further, the generating unit (234) checks utilizing the one or more re-trained models, whether characters of the at least the portion of the search query is present in one or more historic search queries present in the trained data. Further, the generating unit (234) constructs, utilizing the one or more re-trained models, one or more complete search queries by adding remaining characters from the one or more historic search queries to the at least the portion of the search query, when determined that the characters of the at least the portion of the search query is present in the one or more historic search queries based on checking. Further, the generating unit (234) autosuggests, utilizing the one or more re-trained models, the one or more complete search queries to the user. Further, the generating unit (234) generates, utilizing the one or more re-trained models, the one or more inferences based on checking similar inferences present in the trained data using the complete search query. The one or more inferences are generated in a format including at least one text based format.
[0075] In an example, a user types "best Italian" into a search bar. The generating unit (234) receives this partial search query. Further, the generating unit (234) then checks against its re-trained models to see if "best Italian" matches any historic search queries in its dataset. Finding a match, the generating unit (234) discovers that users frequently searched for "best Italian restaurants near me" and "best Italian recipes." Using this information, the generating unit (234) constructs complete search queries: "best Italian restaurants near me" and "best Italian recipes." Then, the generating unit (234) autosuggests these complete queries to the user.
[0076] The notifying unit (226) notifies a status of retraining the one or more trained model to the user. In an embodiment, the notifying unit (226) notifies status of completion of retraining the one or more trained models utilizing one or more identifiers. The one or more identifier can be, for example, but not limited to training name, model name, version, type and/or name of the data source used. The notifying unit (226) performs the one or more actions. The one or more action can be for example but not limited to retrain the trained model or delete the trained model.
[0077] In an example, a company regularly updates its machine learning model for predicting customer behaviour based on the new data or the existing data. The training process is automated, and the team needs to be notified once the training is completed. The notifying unit (226) is responsible for sending the notifications to the users about the status of the model training. The notifying unit (226) is used to specify which model has been trained and relevant details (e.g., Model Name: CustomerChurnPredictor, Version: 2.1, Training Name: Analysis_2024, Data Source: Customer_Dataset_2024, Status: Completed Successfully).
[0078] Further, the creating unit (228) create a training name and a model name (e.g., CustomerChurnPredictor or the like) for training the model based on receiving the training name and the user selecting the model’s name from the list of model names via the user interface (206). Further, the version setting unit (230) sets a version based on the created training name. In an example, the version setting unit (230) sets the version 2.1 based on the created training name.
[0079] Further, the allocating unit (232) allocates one or more network elements (e.g., server, network functions (e.g., Access and Mobility Management Function (AMF) entity or the like)) for training the trained model based on the user selecting the one or more network elements for training the trained model via the user interface (206).
[0080] Further, the options unit (236) provides a plurality of options to select the one or more trained models for inferencing. The plurality of options includes at least one of, options of the one or more trained models trained by the user or the options of the one or more trained models trained by third parties. In the machine learning application, the options unit (236) presents the user interface (206) where the user can choose from a variety of trained models for inferencing. The user sees a list with two main categories such as User-Trained Models: "Custom Sales Prediction Model", and "User Behavior Analysis Model" and Third-Party Models: "OpenAI Text Analysis Model" and "Google Image Recognition Model". Further, the user decides to select the "Custom Sales Prediction Model" for analyzing sales data, while also keeping the option to explore the "OpenAI Text Analysis Model" for future text-based tasks. The options unit (236) allows for easy switching between these models to optimize the inferencing process. The example for training the model is explained in FIG. 4 to FIG. 6.
[0081] FIG. 3 is an example schematic representation of the system (300) of FIG. 1 in which various entities operations are explained, according to various embodiments of the present system. It is to be noted that the embodiment with respect to FIG. 3 will be explained with respect to the first UE (102-1) and the system (108) for the purpose of description and illustration and should nowhere be construed as limited to the scope of the present disclosure.
[0082] As mentioned earlier, the first UE (102-1) includes one or more primary processors (305) communicably coupled to the one or more processors (202) of the system (108). The one or more primary processors (305) are coupled with a memory (310) storing instructions which are executed by the one or more primary processors (305). Execution of the stored instructions by the one or more primary processors (305) enables the UE (102-1). The execution of the stored instructions by the one or more primary processors (305) further enables the UE (102-1) to execute the requests in the communication network (106).
[0083] As mentioned earlier, the one or more processors (202) is configured to transmit a response content related to the trained model to the UE (102-1). More specifically, the one or more processors (202) of the system (108) is configured to transmit the response content to at least one of the UE (102-1). A kernel (315) is a core component serving as the primary interface between hardware components of the UE (102-1) and the system (108). The kernel (315) is configured to provide the plurality of response contents hosted on the system (108) to access resources available in the communication network (106). The resources include one of a Central Processing Unit (CPU), memory components such as Random Access Memory (RAM) and Read Only Memory (ROM).
[0084] As per the illustrated embodiment, the system (108) includes the one or more processors (202), the memory (204), the input/output interface unit (206), the display (208), and the input device (210). The operations and functions of the one or more processors (202), the memory (204), the input/output interface unit (206), the display (208), and the input device (210) are already explained in FIG. 2. For the sake of brevity, we are not explaining the same operations (or repeated information) in the patent disclosure. Further, the processor (202) includes retrieving unit (216), the categorizing unit (218), the pre-processing unit (220), the tuning unit (222), the training unit (224), the notifying unit (226), the creating unit (228), the version setting unit (230), the allocating unit (232), the generating unit (234) and the option unit (236). The operations and functions of the retrieving unit (216), the categorizing unit (218), the pre-processing unit (220), the tuning unit (222), the training unit (224), the notifying unit (226), the creating unit (228), the version setting unit (230), the allocating unit (232), the generating unit (234) and the option unit (236) are already explained in FIG. 2. For the sake of brevity, we are not explaining the same operations (or repeated information) in the patent disclosure.
[0085] FIG. 4 illustrates a system architecture (400) for inferencing one or more trained models, according to various embodiments of the present system. The system architecture (400) includes the one or more processors (202), the memory (204), the input/output interface unit (206), the display (208), and the input device (210). The operations and functions of the one or more processors (202), the memory (204), the input/output interface unit (206), the display (208), and the input device (210) are already explained in FIG. 2. For the sake of brevity, we are not explaining the same operations (or repeated information) in the patent disclosure. Further, the processor (202) includes retrieving unit (216), the categorizing unit (218), the pre-processing unit (220), the tuning unit (222), the training unit (224), the notifying unit (226), the creating unit (228), the version setting unit (230), the allocating unit (232), the generating unit (234) and the option unit (236). The operations and functions of the retrieving unit (216), the categorizing unit (218), the pre-processing unit (220), the tuning unit (222), the training unit (224), the notifying unit (226), the creating unit (228), the version setting unit (230), the allocating unit (232), the generating unit (234) and the option unit (236) are already explained in FIG. 2. For the sake of brevity, we are not explaining the same operations (or repeated information) in the patent disclosure.
[0086] Further, the system architecture (400) includes the system (108) configured to interact with an integrated system (402) via a load balancer (404) and a data-lake (406). The system (108) is integrated with a Large Language Model (LLM) to provide provisions for optimal retaining.
[0087] The integrated system (402) collects raw data from different data sources. The load balancer (404) distributes the data source request traffic across the system architecture (400). The input device (210) is for taking the inputs from the user. By means of the input device (210), the user will give the training name for the model name. The model name is selected by the user from the list of models like GPT-J, LAMA2, Bloom, GPT-neo and Falcon etc. The LLM as a service sets the training version by default for a specified training name (for example). The user will select the execution group where the training is going to execute from the list provided by the LLM as a service. Further, the system (108) allows an entry to a new data source. In simple terms, if the user selects the new data source then, the system (108) creates a new data source by retrieving the data from data as file input, data from the source path, data as the input stream, data from the HTTP2, data from the HDFS (Hadoop Distributed File Systems) and data from NAS (network attached storage).
[0088] If the user selects existing data source then, the user will select the data source from the list of existing data sources. Further, the system (108) pre-processes the data received to normalize and clean it. On the basis of the historic data and the current data, the LLM as a service does the preprocessing such as cleaning and normalizing the text content while removing formatting tags and irrelevant elements from the data source which contain various formatting elements such as headings, tables, footnotes, page numbers, other structural components and images. The cleaning and normalizing the data is performed by removing the noise from extra rows which contain invalid column values, such as NaN, None, 0, null, or empty strings. By using the tuning unit (222), the system (108) with LLM as a service automatically evaluates the user's data, identifies the most suitable hyper-parameters to the ML algorithm that is most likely to yield the best results. The LLM as a service auto tune the hyper-parameters based on data size, model weight size and suitable for provided input. The training unit (224) executes a training schedule for the ML models. The ML training is performed on the input data. The display (208) displays a training status list which contains tabular view of the training name, the model name, the version, the data source type, the status and action like retrain and delete.
[0089] The generating unit (234) selects the training name and version from the created models list on which inference is to be performed. The generating unit (234) and the option unit (236) are configured to enable the user to ask questions to the trained model and LLM as a service gives the answer to that questions. Also, LLM as a service can generate the text to the context provided by the user.
[0090] Further, the system (108) is connected to the data-lake (406) which is a distributed database used to store the processed data and algorithm outputs. The LLM as a service stores the trained model by performing retrain of existing model on new data source or existing data source which can be used by other users also for further retraining and inference.
[0091] The system (108) introduces LLM as a service to create a new data source and train the data source on the provided input and to enable a user to retrain the model with the existing data source or the new data source on different existing trained model. This helps the user to get better results for provided data source with the new data source or the existing data source. The data analysis can be a tedious process and users require knowledge about this. Further, the system (108) having LLM as a service, gives the option of inference to the user after selecting created trained models. The user asks questions to the trained models and it will generate the answer on the basis of trained models. Also, the text generation is possible from all available trained models in the network (106).
[0092] The system (108) is configured to interact with an external and internal data sources (not shown). The system (108) may further include one or more database and is capable of interacting with one or more application server in the network (106).
[0093] Further, the system (108) may be configured to interact with various components of the network and external network by means of various Application Programming Interface (API), databases and servers or any other compatible element. The databases (214)/data lakes (406) are configured to store past data, dynamic data, and trained models for future necessity. The databases (214)/data lakes (406) are configured to store past data, dynamic data, and trained models for future necessity. The present system may further be configured to incorporate even more data into pre-processing steps if required to refine the data analysis. The pre-processing step involves extracting and normalizing the data by applying suitable operation filter, normalization, cleaning and standardization of data.
[0094] The system (108) may further be configured to incorporate even more data into pre-processing steps if required to refine the data analysis. The pre-processing step involves extracting and normalizing the data by applying suitable operation filter, normalization, cleaning and standardization of data.
[0095] In an example, the user, by means of the present system (108), first creates a training name and selects the model name. The user will give the training name. The training name (or model name) is selected by the user from the list of models like GPT-J, LAMA2, Bloom, GPT-neo and Falcon. The system (108) sets the version by the default. The LLM as a service sets the training version by default for the specified training name. For example, if the user given training name as trainingname1 then, a first time version will be set to version1.1 by default. When a second time training name is given as TrainingName1 then, the version will be set as version1.2 by default.
[0096] After creating the training name and setting of the version, the user will select the execution group where the training is going to execute from the list provided by the LLM as a service. The execution group is the group of different servers. Whatever execution groups are accessible to that particular user is displayed in the list and user has to select the execution from the list. The ML training will be performed on the selected execution group. After that, the user has to select the data source. If the user selects a new data source then, the system (108) creates a new data source by retrieving the data from the data as file input, the data from source path, the data as input stream, the data from HTTP2, the data from HDFS and the data from NAS.
[0097] If the user selects existing data source then, the user will select the data source from the list of existing data sources. The user has to select the source path. The data path is determined by providing valid credentials for the new data source and for the existing data source. Further, the user has to select the data source from drop down list of the LLM. On the basis of historic and current data, the LLM as a service does the preprocessing such as cleaning and normalizing the text content while removing formatting tags and irrelevant elements from the data source which contain various formatting elements such as headings, tables, footnotes, page numbers, other structural components and images.
[0098] The LLM also performs cleaning and normalizing on the data by removing the noise from extra rows which contain invalid column values, such as NaN, None, 0, null, or empty strings. The system (108) then executes ML training. Further, the system (108) then performs the auto-tuning of the hyper-parameters based on the data size, the model weight size and suitable for provided input. Then, the required ML training is performed on the given data by the system (108). The LLM as a service stores the trained model by performing retrain of existing model on new data source or existing data source which can be used by other users also for further retraining and inference. The LLM as a service displays the training status list which contains tabular view of the training name, the model name, the version, the data source type, the status and the action like retrain and delete. For inferring, the user has to select the training name and version from the created models list on which inference is to be performed then LLM as service inferences the selected created models. The user may ask questions to the trained model and LLM as a service gives answer to those questions. Also, LLM as a service can generate the text to the context provided by user.
[0099] The most unique aspect of this invention is the capability to use data from different sources with different format and type. The various data including data as file input, the data from the source path, the data as input stream, the data from the HTTP2, the data from the HDFS and the data from the NAS is received by the present system (108) with LLM as a service to retrain the existing model with the new data source or the existing data source. Another inventive feature is the ability to retrain an already trained model with existing data or new data as per requirement. The system (108) with LLM as a service provides the option for the user to retrain the existing model GPT-J, LAMA2, Bloom, GPT-neo and Falcon on new data source as well as existing data source. The system (108) gives the trained model after retraining the existing model on the data as file input, the data from source path, the data as input stream, the data from HTTP2, the data from HDFS and the data from NAS. Yet another inventive feature lies in the ability of the present system with LLM as a service is that the system (108) is configured to provide the option to the user to inference the trained model. The questions can be asked by the user of the trained model in the network (106) by the present system with LLM as a service. The system (108) is configured to auto-suggest some questions which can be asked on the trained data which helps the user to infer the trained model easily. The text generation is also possible by the present system (108) when required for further analysis.
[00100] The system (108) is configured to receive various data including the data as file input, the data from source path, the data as input stream, the data from HTTP2, the data from HDFS and the data from NAS to retrain the existing model with the new data source or the existing data source. The integration with LLM as a service, provides the option for the user to retrain the existing model like GPT-J, LAMA2, Bloom, GPT-neo and Falcon etc. on the new data source as well as the existing data source. The system (108) having LLM as a service gives the trained model after retraining the existing model on the data as file input, the data from source path, the data as input stream, the data from HTTP2, the data from HDFS and the data from NAS. As the system (108) is configured to automatically preprocess and tune data, system (108) becomes easier for the user to train/retrain the ML model. The system (108) having LLM as a service, also provides a training status list which contains tabular view of the training name, the model name, the version, the data source type, the status and the action. In the action column, the user can delete or retrain the trained model by seeing the training name, the model name and the version. This helps the user to retrain the model as many times on same model with new data source or existing data source with different versions.
[00101] The system (108) may further be configured to interact with the application servers, an IPM (integrated performance management) system, an FMS (Fulfillment Management System), an NMS (network management system) modules in the network (106) via an application programming interface (API) as medium of communication and may perform the process by means of various formats like JSON(JavaScript Object Notation), Python or any other compatible formats.
[00102] FIG. 5 is an exemplary flow diagram (500) illustrating the method for inferencing one or more trained models, according to various embodiments of the present disclosure.
[00103] At 502, the method includes retrieving the data from at least one of: the existing data source or the new data source. In an embodiment, the method allows the retrieving unit (216) to retrieve data from at least one of: the existing data source or the new data source.
[00104] At 504, the method includes categorizing the data as at least one of the historic data and the current data. In an embodiment, the method allows the categorizing unit (218) to categorize, the data as at least one of: the historic data and the current data.
[00105] At 506, the method includes pre-processing the historic data and the current data. In an embodiment, the method allows the pre-processing unit (220) to pre-process the historic data and the current data.
[00106] At 508, the method includes autotuning the one or more hyperparameters for the pre-processed historic data and the current data. In an embodiment, the method allows the tuning unit (222) to autotune the one or more hyperparameters for the pre-processed historic data and the current data.
[00107] At 510, the method includes retraining the one or more trained model with the historic data and the current data. The historic data and the current data are pre-processed and autotuned with the one or more hyperparameters. In an embodiment, the method allows the training unit (224) to retrain the one or more trained model with the historic data and the current data.
[00108] At 512, the method includes generating, utilizing the one or more re-trained models, one or more inferences based on the user inputting the portion of the search query via the user interface (206). In an embodiment, the method allows the generating unit (234) to, generate, utilizing the one or more re-trained models, the one or more inferences based on the user inputting the portion of the search query via the user interface (206).
[00109] FIG. 6 is an example flow diagram (600) illustrating the method for inferencing one or more trained models, according to various embodiments of the present disclosure. The method is executed by the present system (108) is integrated with LLM as a service to infer selected models which are already trained or are being trained, by any authenticated user as well as to retrain the existing trained model with efficient pre-processing of data obtained from existing or new data source.
[00110] At 602, the user of the system (108) creates the training name and selects the model name from the list of models like GPT-J, LAMA2, Bloom, GPT-neo and Falcon, by using the user interface (206). At 604, the system (108) automatically sets the version by default for the particular training name. When the second time training name is given as TrainingName1 then, the version will be set as version1.2 by default.
[00111] At 606, the user of the system (108) selects the execution group. The execution group is the group of different servers. Whatever the execution groups are accessible to that particular user is displayed in the list and the user has to select the execution from the list. The ML training will be performed on selected execution group.
[00112] At 608, the user of the system (108) selects the data source as existing or new. If the user selects the new data source then, the system (108) creates the new data source by retrieving the data from data as file input, the data from source path, the data as input stream, the data from HTTP2, the data from HDFS and the data from NAS. If the user selects the existing data source then user will select the data source from the list of existing data sources.
[00113] At 610, the system (108) with LLM as service preprocesses the data source. On the basis of historic and current data does the preprocessing. The cleaning and normalizing the text content is performed while removing formatting tags and irrelevant elements from the data source which contain various formatting elements such as headings, tables, footnotes, page numbers, other structural components and images. The cleaning and normalizing the data is performed by removing the noise from extra rows which contain invalid column values, such as NaN, None, 0, null, or empty strings.
[00114] At 612, the system (108) automatically tunes the hyper-parameters based on the data size, the model weight size and suitable for provided input. At 614, the system (108) executes the ML training. At 616, the system (108) with LLM as a service stores the trained model by performing retrain of existing model on the new data source or the existing data source. At 618, the system (108) displays the training status list which contains tabular view of training name, model name, version, data source type, status and action like retrain and delete. At 620, the user selects the training name and version from the created models list on which inference is to be performed. At 622, the system (108) with LLM as service inferences selected created models. The user can ask questions and LLM as a service will provide answer on the basis of selected trained model and generates text generation if prompted by the user.
[00115] In preferred embodiments, the method may also include various steps to collect information from network elements like servers and other network functions, trigger consecutive operational procedures etc., improve learning methodology for retraining the machine learning models and may not be considered strictly limited to the above method steps.
[00116] Below is the technical advancement of the present invention:
[00117] The system (108) enables an authenticated user to ask questions related to specific models and receive answers, to browse other models which are trained by other users if required. The system (108) includes interactive inference option that ensures better understanding of all models in the network (106), provision of retraining them as per requirement which saves storage, time, effort and resources and store the changes as well as user conversations for future analysis. The method executed by the present system (108) involves the steps to infer existing models and retrain them if required, thus optimizing the model training procedure.
[00118] The system (108) has a capability to train the ML model using various data sources. After ML training, the system (108) is configured to provide the option to save the trained model and can be used by other users also for further inference. The system (108) with LLM as service gives the user the option of inference of the trained model. The user can ask questions to the selected trained model and can get results for the same. The system (108) having the LLM as a service is also configured for generating the text for given prompt/context. This helps the user to analyse the efficiency of the data based on the results. The system (108) has the ability to preprocess different type of data sources. Based on the proposed method, the requirement of training the model from scratch has been eliminated. The system (108) offers the improved efficiency and faster operation. This efficiency improvement not only saves time but also enhances the overall performance of the algorithms. By auto-tuning the hyper-parameters, the system (108) having LLM as a service achieve better accuracy, robustness, and adaptability to different data sources.
[00119] A person of ordinary skill in the art will readily ascertain that the illustrated embodiments and steps in description and drawings (FIGS. 1-6) are set out to explain the exemplary embodiments shown, and it should be anticipated that ongoing technological development will change the manner in which particular functions are performed. These examples are presented herein for purposes of illustration, and not limitation. Further, the boundaries of the functional building blocks have been arbitrarily defined herein for the convenience of the description. Alternative boundaries can be defined so long as the specified functions and relationships thereof are appropriately performed. Alternatives (including equivalents, extensions, variations, deviations, etc., of those described herein) will be apparent to persons skilled in the relevant art(s) based on the teachings contained herein. Such alternatives fall within the scope and spirit of the disclosed embodiments.
[00120] Method steps: A person of ordinary skill in the art will readily ascertain that the illustrated steps are set out to explain the exemplary embodiments shown, and it should be anticipated that ongoing technological development will change the manner in which particular functions are performed. These examples are presented herein for purposes of illustration, and not limitation. Further, the boundaries of the functional building blocks have been arbitrarily defined herein for the convenience of the description. Alternative boundaries can be defined so long as the specified functions and relationships thereof are appropriately performed. Alternatives (including equivalents, extensions, variations, deviations, etc., of those described herein) will be apparent to persons skilled in the relevant art(s) based on the teachings contained herein. Such alternatives fall within the scope and spirit of the disclosed embodiments.
[00121] The present invention offers multiple advantages over the prior art and the above listed are a few examples to emphasize on some of the advantageous features. The listed advantages are to be read in a non-limiting manner.

REFERENCE NUMERALS
[00122] Environment - 100
[00123] UEs– 102, 102-1-102-n
[00124] Server - 104
[00125] Communication network – 106
[00126] System – 108
[00127] Processor – 202
[00128] Memory – 204
[00129] User Interface – 206
[00130] Display – 208
[00131] Input device – 210
[00132] Database – 214
[00133] Retrieving unit– 216
[00134] Categorizing unit – 218
[00135] Pre-processing unit – 220
[00136] Tuning unit - 222
[00137] Training unit – 224
[00138] Notifying unit – 226
[00139] Creating unit – 228
[00140] Version setting unit – 230
[00141] Allocating unit – 232
[00142] Generating unit – 234
[00143] Option unit - 236
[00144] System - 300
[00145] Primary processors -305
[00146] Memory– 310
[00147] Kernel– 315
[00148] System architecture – 400
[00149] Integrated system – 402
[00150] Load balancer – 404
[00151] Data-lake - 406
,CLAIMS:CLAIMS:
We Claim:
1. A method for inferencing one or more trained models, the method comprising the steps of:
retrieving, by one or more processors (202), data from at least one of, an existing data source or a new data source;
categorizing, by the one or more processors (202), the data as at least one of, historic data and current data;
pre-processing, by the one or more processors (202), the historic data and the current data;
autotuning, by the one or more processors (202), one or more hyperparameters for the pre-processed historic data and the current data;
retraining, by the one or more processors (202), the one or more trained models with the historic data and the current data, wherein the historic data and the current data are pre-processed and autotuned with the one or more hyperparameters; and
generating, by the one or more processors (202), utilizing the one or more re-trained models, one or more inferences based on a user inputting a portion of a search query via a user interface (206).

2. The method as claimed in claim 1, wherein the step of, retrieving, data from an existing data source, includes the step of:
retrieving, by the one or more processors (202), the data from the existing data source based on the user selecting the existing data source via the user interface (206).

3. The method as claimed in claim 1, wherein the step of, retrieving, data from a new data source, includes the steps of:
creating, by the one or more processors (202), the new data source when the user selects one or more data sources from a list of data sources via a user interface (206);
pulling, by the one or more processors (202), the data from one or more data sources and storing the data at the new data source; and
retrieving, by the one or more processors (202), the data from the new data source.

4. The method as claimed in claim 1, wherein the new data source includes at least one of, a file input, a source path, input stream, Hypertext Transfer Protocol2 (HTTP2), Hadoop Distributed File System (HDFS) and a Network Attached Storage (NAS).

5. The method as claimed in claim 1, wherein the data is categorized as at least one of, the historic data and the current data based on a time of generation of the data.

6. The method as claimed in claim 1, wherein the step of, pre-processing of the historic data and the current data, includes at least one of, cleaning and normalizing text content, removing and/or formatting tags, removing irrelevant elements and removing noise.

7. The method as claimed in claim 1, wherein the method further comprising the steps of:
creating, by the one or more processors (202), a training name and a model name for retraining the one or more trained models based on receiving the training name and the user selecting the model’s name from a list of model names via the user interface (206);
setting, by the one or more processors (202), a version based on the created training name; and
allocating, by the one or more processors (202), one or more network elements for training the model based on the user selecting the one or more network elements for training the model via the user interface (206).

8. The method as claimed in claim 1, wherein the method further comprises the step of:
notifying, by the one or more processors (202), the user, a status of re-training the one or more trained models, wherein the status of notifying the user of training the one or more models, includes at least one of, status of completion of retraining the existing trained one or more models utilizing one or more identifiers including at least one of, training name, model name, version, type and/or name of the data source used and one or more actions including at least one of, retrain or delete the trained mod.

9. The method as claimed in claim 1, wherein the step of, generating, utilizing the one or more re-trained models, one or more inferences based on the user inputting the portion of the search query via the user interface (206), includes the steps of:
checking, by the one or more processors, utilizing the one or more re-trained models, whether characters of the at least the portion of the search query is present in one or more historic search queries present in the historic data;
constructing, by the one or more processors, utilizing the one or more re-trained models, one or more complete search queries by adding remaining characters from the one or more historic search queries to the at least the portion of the search query, when determined that the characters of the at least the portion of the search query is present in the one or more historic search queries based on checking;
autosuggesting, by the one or more processors, utilizing the one or more re-trained models, the one or more complete search queries to the user;
generating, by the one or more processors, utilizing the one or more re-trained models, the one or more inferences based on checking similar inferences present in the trained data using the complete search query.

10. The method as claimed in claim 1, wherein the method further comprising the step of:
providing, by the one or more processors, a plurality of options to select the one or more trained models for inferencing, wherein the plurality of options includes at least one of, options of the one or more trained models trained by the user or the options of the one or more trained models trained by third parties.

11. A system (108) for inferencing one or more trained models, the system (108) comprising:
a retrieving unit (216), configured to, retrieve, data from at least one of, an existing data source or a new data source;
a categorizing unit (218), configured to, categorize, the data as at least one of, historic data and current data;
a pre-processing unit (220), configured to, automatically pre-process, the historic data and the current data;
a tuning unit (222), configured to, autotune, one or more hyperparameters for the pre-processed historic data and the current data;
a training unit (224), configured to, retrain, the one or more trained model with the historic data and the current data, wherein the historic data and the current data are pre-processed and autotuned with the one or more hyperparameters; and
a generating unit (234), configured to, generate, utilizing the one or more re-trained models, one or more inferences based on a user inputting a portion of a search query via an user interface (206).

12. The system (108) as claimed in claim 11, wherein the retrieving unit (216), retrieves, the data from the existing data source, by:
retrieving, the data from the existing data source based on the user selecting the existing data source via the user interface (206).

13. The system (108) as claimed in claim 11, wherein the retrieving unit (216), retrieves the data from the new data source, by:
creating, the new data source when the user selects one or more data sources from a list of data sources via the user interface (206);
pulling, the data from one or more data sources and storing the data at the new data source; and
retrieving, the data from the new data source.

14. The system as claimed in claim 11, wherein the new data source includes at least one of, a file input, a source path, input stream, Hypertext Transfer Protocol2 (HTTP2), Hadoop Distributed File System (HDFS) and a Network Attached Storage (NAS).

15. The system (108) as claimed in claim 11, wherein the data is categorized as at least one of, the historic data and the current data based on a time of generation of the data.

16. The system (108) as claimed in claim 11, wherein the pre-processing unit (220), pre-processes the historic data and the current data includes at least one of, cleaning and normalizing text content, removing and/or formatting tags, removing irrelevant elements and removing noise. by:

17. The system (108) as claimed in claim 11, wherein the system (108) further comprising:
a creating unit (228), configured to, create, a training name and a model name for retraining the one or more trained model based on receiving the training name and the user selecting the model’s name from a list of model names via the user interface (206);
a version setting unit (230), configured to, set, a version based on the created training name; and
an allocating unit (232), configured to, allocate, one or more network elements for training the model based on the user selecting the one or more network elements for training the model via the user interface (206).

18. The system (108) as claimed in claim 11, wherein the system (108) further comprising:
a notifying unit (226), configured to, notify, the user, a status of re-training the one or more trained models, wherein the status of notifying the user of training the one or more trained models, includes at least one of, status of completion of retraining the one or more trained models utilizing one or more identifiers including at least one of, training name, model name, version, type and/or name of the data source used and one or more actions including at least one of, retrain or delete the trained model.

19. The system (108) as claimed in claim 11, wherein the generating unit (234), generates, utilizing the one or more re-trained models, one or more inferences based on the user inputting at least the portion of the search query via the user interface, by:
receiving, at least the portion of the search query from the user;
checking, utilizing the one or more re-trained models, whether characters of the at least the portion of the search query is present in one or more historic search queries present in the trained data;
constructing, utilizing the one or more re-trained models, one or more complete search queries by adding remaining characters from the one or more historic search queries to the at least the portion of the search query, when determined that the characters of the at least the portion of the search query is present in the one or more historic search queries based on checking;
autosuggesting, by the one or more processors, utilizing the one or more re-trained models, the one or more complete search queries to the user; and
generating, by the one or more processors, utilizing the one or more re-trained models, the one or more inferences based on checking similar inferences present in the trained data using the complete search query.

20. The system (108) as claimed in claim 11, wherein the system (108) further comprising:
an options unit (236), configured to, provide, a plurality of options to select the one or more trained models for inferencing, wherein the plurality of options includes at least one of, options of the one or more trained models trained by the user or the options of the one or more trained models trained by third parties.

Documents

Application Documents

# Name Date
1 202321067264-STATEMENT OF UNDERTAKING (FORM 3) [06-10-2023(online)].pdf 2023-10-06
2 202321067264-PROVISIONAL SPECIFICATION [06-10-2023(online)].pdf 2023-10-06
3 202321067264-FORM 1 [06-10-2023(online)].pdf 2023-10-06
4 202321067264-FIGURE OF ABSTRACT [06-10-2023(online)].pdf 2023-10-06
5 202321067264-DRAWINGS [06-10-2023(online)].pdf 2023-10-06
6 202321067264-DECLARATION OF INVENTORSHIP (FORM 5) [06-10-2023(online)].pdf 2023-10-06
7 202321067264-FORM-26 [27-11-2023(online)].pdf 2023-11-27
8 202321067264-Proof of Right [12-02-2024(online)].pdf 2024-02-12
9 202321067264-DRAWING [01-10-2024(online)].pdf 2024-10-01
10 202321067264-COMPLETE SPECIFICATION [01-10-2024(online)].pdf 2024-10-01
11 Abstract.jpg 2024-11-21
12 202321067264-Power of Attorney [24-01-2025(online)].pdf 2025-01-24
13 202321067264-Form 1 (Submitted on date of filing) [24-01-2025(online)].pdf 2025-01-24
14 202321067264-Covering Letter [24-01-2025(online)].pdf 2025-01-24
15 202321067264-CERTIFIED COPIES TRANSMISSION TO IB [24-01-2025(online)].pdf 2025-01-24
16 202321067264-FORM 3 [31-01-2025(online)].pdf 2025-01-31