Sign In to Follow Application
View All Documents & Correspondence

System And Method For Retraining An Existing Trained Model

Abstract: ABSTRACT SYSTEM AND METHOD FOR RETRAINING AN EXISTING TRAINED MODEL The present disclosure relates to a method for retraining an existing trained model by one or more processors (202). The method includes retrieving data from at least one of, an existing data source or a new data source. Further, the method includes categorizing the data as at least one of, historic data and current data. Further, the method includes pre-processing the historic data and the current data. Further, the method includes autotuning one or more hyperparameters for the pre-processed historic data and the current data based on analysis of the historic data and the current data. Further, the method includes re-training the trained model with the historic data and the current data. The historic data and the current data are pre-processed and autotuned with the one or more hyperparameters. Further, the method includes notifying, at least one of, a user or one of, a service, a microservice, an application, or a component, a status of re-training the existing trained model. Ref. FIG. 5

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #
Filing Date
06 October 2023
Publication Number
15/2025
Publication Type
INA
Invention Field
COMPUTER SCIENCE
Status
Email
Parent Application

Applicants

JIO PLATFORMS LIMITED
OFFICE-101, SAFFRON, NR. CENTRE POINT, PANCHWATI 5 RASTA, AMBAWADI, AHMEDABAD, GUJARAT, INDIA

Inventors

1. Aayush Bhatnagar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
2. Ankit Murarka
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
3. Jugal Kishore
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
4. Chandra Ganveer
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
5. Sanjana Chaudhary
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
6. Gourav Gurbani
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
7. Yogesh Kumar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
8. Avinash Kushwaha
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
9. Dharmendra Kumar Vishwakarma
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
10. Sajal Soni
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
11. Niharika Patnam
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
12. Shubham Ingle
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
13. Harsh Poddar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
14. Sanket Kumthekar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
15. Mohit Bhanwria
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
16. Shashank Bhushan
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
17. Vinay Gayki
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
18. Aniket Khade
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
19. Durgesh Kumar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
20. Zenith Kumar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
21. Gaurav Kumar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
22. Manasvi Rajani
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
23. Kishan Sahu
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
24. Sunil Meena
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
25. Supriya Kaushik De
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
26. Kumar Debashish
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
27. Mehul Tilala
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
28. Satish Narayan
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
29. Rahul Kumar
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India
30. Girish Dange
Reliance Corporate Park, Thane - Belapur Road, Ghansoli, Navi Mumbai, Maharashtra 400701, India

Specification

DESC: FORM 2

THE PATENTS ACT, 1970
(39 of 1970)
&
THE PATENTS RULES, 2003

COMPLETE SPECIFICATION
(See section 10 and rule 13)
1. TITLE OF THE INVENTION

SYSTEM AND METHOD FOR RETRAINING AN EXISTING TRAINED MODEL
2. APPLICANT(S)
NAME NATIONALITY ADDRESS
JIO PLATFORMS LIMITED INDIAN OFFICE-101, SAFFRON, NR. CENTRE POINT, PANCHWATI 5 RASTA, AMBAWADI, AHMEDABAD 380006, GUJARAT, INDIA
3.PREAMBLE TO THE DESCRIPTION

THE FOLLOWING SPECIFICATION PARTICULARLY DESCRIBES THE NATURE OF THIS INVENTION AND THE MANNER IN WHICH IT IS TO BE PERFORMED.

FIELD OF THE INVENTION
[0001] The present invention relates to the field of network data analytics for predictive network management and, more specifically, to a system and a method thereof for retraining of an existing trained model with an existing data source or a new data source so as to improve accuracy of the existing trained model.
BACKGROUND OF THE INVENTION
[0002] With the increase in number of users, the network service providers have been implementing to up-gradations to enhance a service quality so as to keep pace with such high demand. With advancement of technology, there is a demand for a telecommunication service to induce up-to-date features into the scope of provision. To enhance user experience and implement advanced monitoring mechanisms, prediction methodologies are being incorporated in a network management service. An advanced prediction system integrated with an Artificial intelligence (AI)/Machine learning (ML) system excels in executing a wide array of algorithms and predictive tasks. The advanced prediction system is underpinned by capabilities of Large Language Models (LLMs). Its primary mission centers around a comprehensive analysis of both network data and operational data, capitalizing on advanced techniques of the machine learning to glean profound insights. Also, the LLM trains the data source.
[0003] However, the data sources are not always in an appropriate format for model training, and training an already trained model is even more complex. For a new data source or for an existing data source, the ML training for the existing training model is time consuming, and resources (e.g., memory usage, or the like) for training used for the same model is not a suitable and optimal solution. Also, the user needs to have a detailed knowledge of the ML training. There is a requirement for a mechanism for retraining the existing model with the data sources which may be existing or new.
[0004] Hence, there is a requirement for a system and method thereof to retrain the existing model, as required, with data sources which may be existing or new, optimally without consuming too much time or resources.
SUMMARY OF THE INVENTION
[0005] One or more embodiments of the present disclosure provide a system and a method for retraining an existing trained model.
[0006] In one aspect of the present invention, the method for retraining the existing trained model is disclosed. The method includes retrieving, by one or more processors, data from at least one of, an existing data source or a new data source. Further, the method includes categorizing, by the one or more processors, the data as at least one of, historic data and current data. Further, the method includes pre-processing, by the one or more processors, the historic data and the current data. Further, the method includes autotuning, by the one or more processors, one or more hyperparameters for the pre-processed historic data and the current data based on analysis of the historic data and the current data. Further, the method includes re-training, by the one or more processors, the trained model with the historic data and the current data. The historic data and the current data are pre-processed and autotuned with the one or more hyperparameters. Further, the method includes notifying, by the one or more processors, a user, a status of re-training the existing trained model.
[0007] In an embodiment, retrieving, data from the existing data source, includes the step of retrieving, by the one or more processors, the data from the existing data source based on at least one of the user selecting the existing data source via a user interface , or one of, a service, microservice, component and an application transmits a command to the one or more processors to select the existing data source.
[0008] In an embodiment, retrieving, data from the new data source, includes the steps of: creating, by the one or more processors, the new data source when at least one of: the user selects one or more data sources from a list of data sources via the user interface or one of, the service, the microservice, the component and the application transmits a command to the one or more processors to select the one or more data sources, pulling, by the one or more processors, the data from one or more data sources and storing the data at the new data source, and retrieving, by the one or more processors, the data from the new data source.
[0009] In an embodiment, the data is categorized as at least one of, the historic data and the current data based on a time of generation of the data.
[0010] In an embodiment, the pre-processing of the historic data and the current data, includes at least one of, cleaning and normalizing text content, removing and/or formatting tags, removing irrelevant elements and removing noise.
[0011] In an embodiment, the method further includes creating, by the one or more processors, a training name and a model name for retraining the trained model based on receiving the training name and the user selecting the model name from a list of model names via the user interface. Further, the method includes setting, by the one or more processors, a version based on the created training name. Further, the method includes allocating, by the one or more processors, one or more network elements for retraining the existing trained model based on the user selecting the one or more network elements for retraining the existing trained model via the user interface.
[0012] In an embodiment, the status of notifying the user of re-training the existing trained model, includes at least one of, status of completion of retraining the existing trained model utilizing one or more identifiers including at least one of, training name, model name, version, type and/or name of the data source used and one or more actions including at least one of, retrain or delete the trained model.
[0013] In an embodiment, the step of, re-training, the trained model with the historic data and the current data, further includes the step of: storing, by the one or more processors, the re-trained model in a storage unit.
[0014] In one aspect of the present invention, the system for retraining the existing trained model is disclosed. The system includes a retrieving unit, a categorizing unit, a pre-processing unit, a tuning unit, a training unit and a notifying unit. The retrieving unit is configured to retrieve data from at least one of, an existing data source or a new data source. The categorizing unit is configured to categorize the data as at least one of, historic data and current data. The pre-processing unit is configured to pre-process the historic data and the current data. The tuning unit is configured to autotune, one or more hyperparameters for the pre-processed historic data and the current data based on analysis of the historic data and the current data. The training unit is configured to re-train the trained model with the historic data and the current data. The historic data and the current data are pre-processed and autotuned with the one or more hyperparameters. The notifying unit is configured to notify, at least one of, a user or one of, a service, a microservice, an application or a component, a status of re-training the existing trained model.
[0015] In one aspect of the present invention, a non-transitory computer-readable medium having stored thereon computer-readable instructions is provided. The computer-readable instructions causes the processor to retrieve data from at least one of an existing data source or a new data source. Further, the processor categorizes the data as at least one of, historic data and current data. Further, the processor pre-processes the historic data and the current data. Further, the processor autotunes one or more hyperparameters for the pre-processed historic data and the current data based on analysis of the historic data and the current data. Further, the processor re-trains the trained model with the historic data and the current data. The historic data and the current data are pre-processed and autotuned with the one or more hyperparameters. Further, the processor notifies, a user, a status of re-training the existing trained model.
[0016] Other features and aspects of this invention will be apparent from the following description and the accompanying drawings. The features and advantages described in this summary and in the following detailed description are not all-inclusive, and particularly, many additional features and advantages will be apparent to one of ordinary skill in the relevant art, in view of the drawings, specification, and claims hereof. Moreover, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes and may not have been selected to delineate or circumscribe the inventive subject matter, resort to the claims being necessary to determine such inventive subject matter.

BRIEF DESCRIPTION OF THE DRAWINGS
[0017] The accompanying drawings, which are incorporated herein, and constitute a part of this disclosure, illustrate exemplary embodiments of the disclosed methods and systems in which like reference numerals refer to the same parts throughout the different drawings. Components in the drawings are not necessarily to scale, emphasis instead being placed upon clearly illustrating the principles of the present disclosure. Some drawings may indicate the components using block diagrams and may not represent the internal circuitry of each component. It will be appreciated by those skilled in the art that disclosure of such drawings includes disclosure of electrical components, electronic components or circuitry commonly used to implement such components.
[0018] FIG. 1 is an exemplary block diagram of an environment for retraining an existing trained model, according to various embodiments of the present disclosure.
[0019] FIG. 2 is a block diagram of a system of FIG. 1, according to various embodiments of the present disclosure.
[0020] FIG. 3 is an example schematic representation of the system of FIG. 1 in which various entities operations are explained, according to various embodiments of the present system.
[0021] FIG. 4 illustrates a system architecture for retraining of the existing trained model with an existing data source or a new data source, according to various embodiments of the present system.
[0022] FIG. 5 is a flow diagram illustrating the method for retraining the existing trained model, according to various embodiments of the present disclosure.
[0023] FIG. 6 is an example flow diagram illustrating the method for retraining the existing trained model, according to various embodiments of the present disclosure.
[0024] Further, skilled artisans will appreciate that elements in the drawings are illustrated for simplicity and may not have necessarily been drawn to scale. For example, the flow charts illustrate the method in terms of the most prominent steps involved to help to improve understanding of aspects of the present invention. Furthermore, in terms of the construction of the device, one or more components of the device may have been represented in the drawings by conventional symbols, and the drawings may show only those specific details that are pertinent to understanding the embodiments of the present invention so as not to obscure the drawings with details that will be readily apparent to those of ordinary skill in the art having benefit of the description herein.
[0025] The foregoing shall be more apparent from the following detailed description of the invention.

DETAILED DESCRIPTION OF THE INVENTION
[0026] Some embodiments of the present disclosure, illustrating all its features, will now be discussed in detail. It must also be noted that as used herein and in the appended claims, the singular forms "a", "an" and "the" include plural references unless the context clearly dictates otherwise.
[0027] Various modifications to the embodiment will be readily apparent to those skilled in the art and the generic principles herein may be applied to other embodiments. However, one of ordinary skill in the art will readily recognize that the present disclosure including the definitions listed here below are not intended to be limited to the embodiments illustrated but is to be accorded the widest scope consistent with the principles and features described herein.
[0028] A person of ordinary skill in the art will readily ascertain that the illustrated steps detailed in the figures and here below are set out to explain the exemplary embodiments shown, and it should be anticipated that ongoing technological development will change the manner in which particular functions are performed. These examples are presented herein for purposes of illustration, and not limitation. Further, the boundaries of the functional building blocks have been arbitrarily defined herein for the convenience of the description. Alternative boundaries can be defined so long as the specified functions and relationships thereof are appropriately performed. Alternatives (including equivalents, extensions, variations, deviations, etc., of those described herein) will be apparent to persons skilled in the relevant art(s) based on the teachings contained herein. Such alternatives fall within the scope and spirit of the disclosed embodiments.
[0029] Before discussing example, embodiments in more detail, it is to be noted that the drawings are to be regarded as being schematic representations and elements that are not necessarily shown to scale. Rather, the various elements are represented such that their function and general purpose becomes apparent to a person skilled in the art. Any connection or coupling between functional blocks, devices, components, or other physical or functional units shown in the drawings or described herein may also be implemented by an indirect connection or coupling. A coupling between components may also be established over a wireless connection. Functional blocks may be implemented in hardware, firmware, software or a combination thereof.
[0030] Further, the flowcharts provided herein, describe the operations as sequential processes. Many of the operations may be performed in parallel, concurrently or simultaneously. In addition, the order of operations maybe re-arranged. The processes may be terminated when their operations are completed, but may also have additional steps not included in the figured. It should be noted, that in some alternative implementations, the functions/acts/ steps noted may occur out of the order noted in the figured. For example, two figures shown in succession may, in fact, be executed substantially concurrently, or may sometimes be executed in the reverse order, depending upon the functionality/acts involved.
[0031] Further, the terms first, second etc… may be used herein to describe various elements, components, regions, layers and/or sections, it should be understood that these elements, components, regions, layers and/or sections should not be limited by these terms. These terms are used only to distinguish one element, component, region, layer or section from another region, layer, or a section. Thus, a first element, component, region layer, or section discussed below could be termed a second element, component, region, layer, or section without departing form the scope of the example embodiments.
[0032] Spatial and functional relationships between elements (for example, between modules) are described using various terms, including “connected,” “engaged,” “interfaced,” and “coupled.” Unless explicitly described as being “direct,” when a relationship between first and second elements is described in the description below, that relationship encompasses a direct relationship where no other intervening elements are present between the first and second elements, and also an indirect relationship where one or more intervening elements are present (either spatially or functionally) between the first and second elements. In contrast, when an element is referred to as being "directly” connected, engaged, interfaced, or coupled to another element, there are no intervening elements present. Other words used to describe the relationship between elements should be interpreted in a like fashion (e.g., "between," versus "directly between," "adjacent," versus "directly adjacent," etc.).
[0033] The terminology used herein is for the purpose of describing particular example embodiments only and is not intended to be limiting. Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which example embodiments belong. It will be further understood that terms, e.g., those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.
[0034] As used herein, the singular forms “a,” “an,” and “the,” are intended to include the plural forms as well, unless the context clearly indicates otherwise. As used herein, the terms “and/or” and “at least one of” include any and all combinations of one or more of the associated listed items. It will be further understood that the terms “comprises,” “comprising,” “includes,” and/or “including,” when used herein, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
[0035] Unless specifically stated otherwise, or as is apparent from the description, terms such as “processing” or “computing” or “calculating” or “determining” of “displaying” or the like, refer to the action and processes of a computer system, or similar electronic computing device/hardware, that manipulates and transforms data represented as physical, electronic quantities within the computer system's registers and memories into other data similarly represented as physical quantities within the computer system memories or registers or other such information storage, transmission or display devices.
[0036] FIG. 1 illustrates an exemplary block diagram of an environment (100) for retraining an existing trained model, according to various embodiments of the present disclosure. The environment (100) comprises a plurality of user equipment’s (UEs) (102-1, 102-2, ……,102-n). The at least one UE (102-n) from the plurality of the UEs (102-1, 102-2, ……102-n) is configured to connect to a system (108) via a communication network (106). Hereafter, label for the plurality of UEs or one or more UEs is 102.
[0037] In accordance with yet another aspect of the exemplary embodiment, the plurality of UEs (102) may be a wireless device or a communication device that may be a part of the system (108). The wireless device or the UE (102) may include, but are not limited to, a handheld wireless communication device (e.g., a mobile phone, a smart phone, a phablet device, and so on), a wearable computer device (e.g., a head-mounted display computer device, a head-mounted camera device, a wristwatch, a computer device, and so on), a laptop computer, a tablet computer, or another type of portable computer, a media playing device, a portable gaming system, and/or any other type of computer device with wireless communication or Voice Over Internet Protocol (VoIP) capabilities. In an embodiment, the UEs (102) may include, but are not limited to, any electrical, electronic, electro-mechanical or an equipment or a combination of one or more of the above devices such as virtual reality (VR) devices, augmented reality (AR) devices, laptop, a general-purpose computer, desktop, personal digital assistant, tablet computer, mainframe computer, or any other computing device, where the computing device may include one or more in-built or externally coupled accessories including, but not limited to, a visual aid device such as camera, audio aid, a microphone, a keyboard, input devices for receiving input from a user such as touch pad, touch enabled screen, electronic pen and the like. It may be appreciated that the UEs (102) may not be restricted to the mentioned devices and various other devices may be used. A person skilled in the art will appreciate that the plurality of UEs (102) may include a fixed landline, and a landline with assigned extension within the communication network (106).
[0038] The communication network (106), may use one or more communication interfaces/protocols such as, for example, Voice Over Internet Protocol (VoIP), 802.11 (Wi-Fi), 802.15 (including Bluetooth™), 802.16 (Wi-Max), 802.22, Cellular standards such as Code Division Multiple Access (CDMA), CDMA2000, Wideband CDMA (WCDMA), Radio Frequency Identification (e.g., RFID), Infrared, laser, Near Field Magnetics, etc.
[0039] The communication network (106) includes, by way of example but not limitation, one or more of a wireless network, a wired network, an internet, an intranet, a public network, a private network, a packet-switched network, a circuit-switched network, an ad hoc network, an infrastructure network, a Public-Switched Telephone Network (PSTN), a cable network, a cellular network, a satellite network, a fiber optic network, or some combination thereof. The communication network (106) may include, but is not limited to, a Third Generation (3G) network, a Fourth Generation (4G) network, a Fifth Generation (5G) network, a Sixth Generation (6G) network, a New Radio (NR) network, a Narrow Band Internet of Things (NB-IoT) network, an Open Radio Access Network (O-RAN), and the like.
[0040] The communication network (106) may also include, by way of example but not limitation, at least a portion of one or more networks having one or more nodes that transmit, receive, forward, generate, buffer, store, route, switch, process, or a combination thereof, etc. one or more messages, packets, signals, waves, voltage or current levels, some combination thereof, or so forth. The communication network (106) may also include, by way of example but not limitation, one or more of a wireless network, a wired network, an internet, an intranet, a public network, a private network, a packet-switched network, a circuit-switched network, an ad hoc network, an infrastructure network, a Public-Switched Telephone Network (PSTN), a cable network, a cellular network, a satellite network, a fiber optic network, a VOIP or some combination thereof.
[0041] One or more network elements can be, for example, but not limited to a base station that is located in the fixed or stationary part of the communication network (106). The base station may correspond to a remote radio head, a transmission point, an access point or access node, a macro cell, a small cell, a micro cell, a femto cell, a metro cell. The base station enables transmission of radio signals to the UE (102) or a mobile transceiver. Such a radio signal may comply with radio signals as, for example, standardized by a 3rd Generation Partnership Project (3GPP) or, generally, in line with one or more of the above listed systems. Thus, a base station may correspond to a NodeB, an eNodeB, a Base Transceiver Station (BTS), an access point, a remote radio head, a transmission point, which may be further divided into a remote unit and a central unit. The 3GPP specifications cover cellular telecommunications technologies, including radio access, core network, and service capabilities, which provide a complete system description for mobile telecommunications.
[0042] The system (108) is communicatively coupled to a server (104) via the communication network (106). The server (104) can be, for example, but not limited to a standalone server, a server blade, a server rack, an application server, a bank of servers, a business telephony application server (BTAS), a server farm, a cloud server, an edge server, home server, a virtualized server, one or more processors executing code to function as a server, or the like. In an implementation, the server (104) may operate at various entities or a single entity (include, but is not limited to, a vendor side, a service provider side, a network operator side, a company side, an organization side, a university side, a lab facility side, a business enterprise side, a defense facility side, or any other facility) that provides service.
[0043] The environment (100) further includes the system (108) communicably coupled to the server (e.g., remote server or the like) (104) and each UE of the plurality of UEs (102) via the communication network (106). The remote server (104) is configured to execute the requests in the communication network (106).
[0044] The system (108) is adapted to be embedded within the remote server (104) or is embedded as an individual entity. The system (108) is designed to provide a centralized and unified view of data and facilitate efficient business operations. The system (108) is authorized to access to update/create/delete one or more parameters of their relationship between the requests for retraining the existing trained model, which gets reflected in real-time independent of the complexity of network.
[0045] In another embodiment, the system (108) may include an enterprise provisioning server (for example), which may connect with the remote server (104). The enterprise provisioning server provides flexibility for enterprises, ecommerce, finance to update/create/delete information related to the requests for the retraining the existing trained model in real time as per their business needs.
[0046] The system (108) may include, by way of example but not limitation, one or more of a standalone server, a server blade, a server rack, a bank of servers, a business telephony application server (BTAS), a server farm, hardware supporting a part of a cloud service or system, a home server, hardware running a virtualized server, one or more processors executing code to function as a server, one or more machines performing server-side functionality as described herein, at least a portion of any of the above, some combination thereof. In an implementation, system (108) may operate at various entities or single entity (for example include, but is not limited to, a vendor side, service provider side, a network operator side, a company side, an organization side, a university side, a lab facility side, a business enterprise side, ecommerce side, finance side, a defense facility side, or any other facility) that provides service.
[0047] However, for the purpose of description, the system (108) is described as an integral part of the remote server (104), without deviating from the scope of the present disclosure. Operational and construction features of the system (108) will be explained in detail with respect to the following figures.
[0048] FIG. 2 illustrates a block diagram of the system (108) provided for retraining the existing trained model (e.g., Artificial intelligence (AI) model, machine learning (ML) model (such as Large language models (LLMs)), or the like), according to one or more embodiments of the present invention. As per the illustrated embodiment, the system (108) includes the one or more processors (202), the memory (204), an input/output interface unit (206), a display (208), an input device (210), and the database (214). Further the system (108) may comprise one or more processors (202). The one or more processors (202), hereinafter referred to as the processor (202) may be implemented as one or more microprocessors, microcomputers, microcontrollers, digital signal processors, central processing units, state machines, logic circuitries, single board computers, and/or any devices that manipulate signals based on operational instructions. As per the illustrated embodiment, the system (108) includes one processor. However, it is to be noted that the system (108) may include multiple processors as per the requirement and without deviating from the scope of the present disclosure.
[0049] An information related to the existing trained model may be provided or stored in the memory (204) of the system (108). Among other capabilities, the processor (202) is configured to fetch and execute computer-readable instructions stored in the memory (204). The memory (204) may be configured to store one or more computer-readable instructions or routines in a non-transitory computer-readable storage medium, which may be fetched and executed to create or share data packets over a network service. The memory (204) may include any non-transitory storage device including, for example, volatile memory such as RAM, or non-volatile memory such as disk memory, EPROMs, FLASH memory, unalterable memory, and the like.
[0050] The memory (204) may comprise any non-transitory storage device including, for example, volatile memory such as Random-Access Memory (RAM), or non-volatile memory such as Electrically Erasable Programmable Read-only Memory (EPROM), flash memory, and the like. In an embodiment, the system (108) may include an interface(s). The interface(s) may comprise a variety of interfaces, for example, interfaces for data input and output devices, referred to as input/output (I/O) devices, storage devices, and the like. The interface(s) may facilitate communication for the system. The interface(s) may also provide a communication pathway for one or more components of the system. Examples of such components include, but are not limited to, processing unit/engine(s) and the database (214). The processing unit/engine(s) may be implemented as a combination of hardware and programming (for example, programmable instructions) to implement one or more functionalities of the processing engine(s).
[0051] The information related to the existing trained model may further be configured to render on the user interface (206). The user interface (206) may include functionality similar to at least a portion of functionality implemented by one or more computer system interfaces such as those described herein and/or generally known to one having ordinary skill in the art. The user interface (206) may be rendered on the display (208), implemented using Liquid Crystal Display (LCD) display technology, Organic Light-Emitting Diode (OLED) display technology, and/or other types of conventional display technology. The display (208) may be integrated within the system (108) or connected externally. Further the input device(s) (210) may include, but not limited to, keyboard, buttons, scroll wheels, cursors, touchscreen sensors, audio command interfaces, magnetic strip reader, optical scanner, etc.
[0052] The database (214) may be communicably connected to the processor (202) and the memory (204). The database (214) may be configured to store and retrieve the request pertaining to features, or services or workflow of the system (108), access rights, attributes, approved list, and authentication data provided by an administrator. In another embodiment, the database (214) may be outside the system (108) and communicated through a wired medium and a wireless medium.
[0053] Further, the processor (202), in an embodiment, may be implemented as a combination of hardware and programming (for example, programmable instructions) to implement one or more functionalities of the processor (202). In the examples described herein, such combinations of hardware and programming may be implemented in several different ways. For example, the programming for the processor (202) may be processor-executable instructions stored on a non-transitory machine-readable storage medium and the hardware for the processor (202) may comprise a processing resource (for example, one or more processors), to execute such instructions. In the present examples, the memory (204) may store instructions that, when executed by the processing resource, implement the processor (202). In such examples, the system (108) may comprise the memory (204) storing the instructions and the processing resource to execute the instructions, or the memory (204) may be separate but accessible to the system (108) and the processing resource. In other examples, the processor (202) may be implemented by an electronic circuitry.
[0054] In order for the system (108) to retrain the existing trained model, the processor (202) includes a retrieving unit (216), a categorizing unit (218), a pre-processing unit (220), a tuning unit (222), a training unit (224), a notifying unit (226), a creating unit (228), a version setting unit (230), an allocating unit (232) and a storage unit (234). The retrieving unit (216), the categorizing unit (218), the pre-processing unit (220), the tuning unit (222), the training unit (224), the notifying unit (226), the creating unit (228), the version setting unit (230), the allocating unit (232) and the storage unit (234) may be implemented as a combination of hardware and programming (for example, programmable instructions) to implement one or more functionalities of the processor (202). In the examples described herein, such combinations of hardware and programming may be implemented in several different ways. For example, the programming for the processor (202) may be processor-executable instructions stored on a non-transitory machine-readable storage medium and the hardware for the processor (202) may comprise a processing resource (for example, one or more processors), to execute such instructions. In the present examples, the memory (204) may store instructions that, when executed by the processing resource, implement the processor. In such examples, the system (108) may comprise the memory (204) storing the instructions and the processing resource to execute the instructions, or the memory (204) may be separate but accessible to the system (108) and the processing resource. In other examples, the processor (202) may be implemented by the electronic circuitry.
[0055] In order for the system (108) to retrain the existing trained model, the retrieving unit (216), the categorizing unit (218), the pre-processing unit (220), the tuning unit (222), the training unit (224), the notifying unit (226), the creating unit (228), the version setting unit (230), the allocating unit (232) and the storage unit (234) are communicably coupled to each other.
[0056] The retrieving unit (216) retrieves data from an existing data source or a new data source. The existing data source or a one or more data source can be, for example, but not limited to file input, source path, input stream, Hypertext Transfer Protocol version 2 (HTTP2), Hadoop Distributed File System (HDFS) and Network Attached Storage. The existing data source of the one or more data sources include trained data of existing trained models. For example, the existing trained models include at least one of, but not limited to, Generative Pre-Trained Transformers-Jumbo (GPT-J), Large Language Model Meta AI2 (LAMA2), Bloom, Generative Pre-Trained Transformers -neo (GPT-neo) and Falcon etc. The GPT-J is a generative pre-trained transformer model designed to produce human-like text that continues from a prompt. The GPT-J is used for various natural language processing tasks, including text generation, translation, and question-answering. The LAMA 2 (Large Language Model Meta AI) is a series of state-of-the-art language models developed by Meta®. The LAMA 2 models are trained on a diverse dataset, allowing them to generate coherent and contextually relevant text. They are available for both research and commercial use, promoting open access and collaboration in the AI community. The Bloom also known as, BigScience Large Open-science Open-access Multilingual Language Model, is an open-access multilingual language model developed by the BigScience research collective. The Bloom aims to provide a powerful and accessible tool for natural language processing, promoting multilingual applications and research in a machine learning field. The GPT-Neo is an open-source language model developed by EleutherAI®, designed to replicate the capabilities of OpenAI's GPT-3. The Falcon is a series of open-source language models developed by the Technology Innovation Institute (TII). The Falcon aims to provide high-performance language models that are freely accessible to the research community and industry, supporting innovation in the field of artificial intelligence.
[0057] In an embodiment, the retrieving unit (216) retrieves the data from the existing data source based on selecting the existing data source via the user interface (206) by the user. In an example, if the user selects the existing data source then, the user of the system (108) will select the data source from the list of existing data sources. In another embodiment, the retrieving unit (216) retrieves the data from the existing data source by selecting the existing data source via the Command Line Interface (CLI). In an alternate embodiment, at least one of: a service, a microservice, a component and an application transmits a command to the one or more processors (202) to select the existing data source. In particular, a handling unit (236) of the system 108, is configured to receive the command from one of, the service, the microservice, the component and the application. Thereafter, the handling unit (236) provides information derived from the command pertaining to selection of the existing data source to the retrieving unit (216). Based on which, the retrieving unit (216) retrieves the data from the existing data source. In an example implementation, the data source is selected/transferred or received (may be via an automatic request) by executing some command at the user interface (206) or the CLI. In an example, a customer support manager logs into the application’s dashboard (UI 206) and selects a new dataset containing recent customer inquiries. They specify parameters for retraining, such as increasing the focus on specific intents (e.g., billing questions). In another example, the developer runs a command using the CLI to initiate retraining with a script that pulls in historical chat logs and current interaction data. In another example, the analytics microservice detects a spike in inquiries about a new feature and automatically sends a request to the training unit (224) to retrain the model with this specific context.
[0058] In another embodiment, the retrieving unit (216) creates the new data source when the user selects one or more data sources from a list of data sources via the user interface (206) or the CLI. In an alternate embodiment, one of: the service, the microservice, the component and the application transmits the command to the retrieving unit (216) to select the one or more data sources. Further, the retrieving unit (216) pulls the data from one or more data sources and stores the data at the new data source. Further, the retrieving unit (216) retrieves the data from the new data source. In simple terms, if the user of the system (108) selects the new data source then, the user of the system (108) creates the new data source by retrieving the data from data as file input or the data from the source path, an input stream, a Hypertext Transfer Protocol (HTTP), Hadoop Distributed File Systems (HDFS) and a network attached storage (NAS).
[0059] Further, the categorizing unit (218) categorizes the data as at least one of historic data and current data. In an embodiment, the data is categorized as at least one of, the historic data and the current data based on a time of generation of the data.
[0060] In an example, consider a retail company that uses the machine learning model to predict sales trends based on the historical data and the current data. The historic data means sales data from the last five years (e.g., daily sales figures, customer details or the like). This data is categorized based on the time it was generated, so any data prior to the last six months is labeled as historic. The current data means the sales data from the last six months, which reflects recent trends and changes in consumer behavior. The current data includes daily sales figures, promotional events, seasonal influences or the like. When the new sales data is generated, the categorizing unit (218) analyzes the timestamp of this data. If the data is from today or the past six months, the categorizing unit (218) categorizes the data as current data. If the data is older than six months, the categorizing unit (218) categorizes the data as the historic data.
[0061] The pre-processing unit (220) pre-processes the historic data and the current data. In an embodiment, the pre-processing of the historic data and the current data includes at least one of: cleaning and normalizing text content, removing and/or formatting tags, removing irrelevant elements and removing noise.
[0062] The tuning unit (222) autotunes one or more hyperparameters for the pre-processed historic data and the current data based on analysis of patterns and trends of the historic data and the current data. In an example, the tuning unit (222) effectively automates a hyperparameter tuning process, so as to improve the ML model accuracy by utilizing both historical and current datasets. This approach helps ensure that the ML model is robust and well-suited to predict customer churn accurately.
[0063] The training unit (224) re-trains the trained model with the historic data and the current data. The historic data and the current data are pre-processed and autotuned with the one or more hyperparameters. After the pre-processing and autotuning, the cleaned dataset (comprising both historic and current data) can be fed into the machine learning model. This improves the model's accuracy and effectiveness in understanding sentiment by reducing variability and noise in the data. In an embodiment, the trained models include at least one of, but not limited to, Generative Pre-Trained Transformers-Jumbo (GPT-J), Large Language Model Meta AI2 (LAMA2), Bloom, Generative Pre-Trained Transformers -neo (GPT-neo) and Falcon etc.
[0064] The notifying unit (226) notifies a status of re-training the existing trained model to the user. In an embodiment, the notifying unit (226) notifies status of completion of re-training the existing trained model utilizing one or more identifiers. The one or more identifier can be, for example, but not limited to training name, model name, version, type and/or name of the data source used. The notifying unit (226) performs the one or more actions. The one or more action can be for example but not limited to retrain the trained model or delete the trained model. The re-trained model is stored in the storage unit (234).
[0065] In an example, a company regularly updates its machine learning model for predicting customer behaviour based on the new data or the existing data. The re-training process is automated, and the team needs to be notified once the re-training is completed. The notifying unit (226) is responsible for sending the notifications to the users about the status of the model re-training. The notifying unit (226) is used to specify which model has been re-trained and relevant details (e.g., Model Name: CustomerChurnPredictor, Version: 2.1, Training Name: Analysis_2024, Data Source: Customer_Dataset_2024, Status: Completed Successfully).
[0066] In an alternate embodiment, the notifying unit (226) is configured to notify one of, the service, the microservice, the application and the component pertaining to the status of retraining the existing trained model. For example, the notifying unit (226) is configured to notify the status of re-training the existing trained model by transmitting an acknowledgment to one of, the service, the microservice, the application using a handling unit (236). The handling unit (236) is configured to keep a record of mappings of interaction of the entities (such as the service, microservice, application, component) with the system 108. Mappings of the interaction of the entities with the system 108 pertains to at least one of, the entities transmitting commands and/or requests to the system 108 to perform one or more actions such as for example selection of one of, the existing data source or the one or more data sources, or allocating the one or more network elements to retrain the existing trained model. Based on the mapping, the handling unit (236) informs the notifying unit 226 to which entity the acknowledgment has to be transmitted pertaining to the status of the re-training of the existing trained model. For example, let us consider that the microservice 1 had transmitted the command at the outset to select one of the existing data source or the one or more data sources, the handling unit (236) keeps a track of this event. Basis which, the handling unit (236) informs the notifying unit 226 to transmit the acknowledgment (response) to the microservice 1 pertaining to the status of retraining the existing trained model.
[0067] Further, the creating unit (228) create a training name and a model name (e.g., CustomerChurnPredictor or the like) for retraining the trained model based on receiving the training name and the user selecting the model’s name from the list of model names via the user interface (206). In an example, the creating unit (228) selects different set of combined historic data and current data. Each set can be allocated to different models selected by the creating unit (228). The re-training execution may happen over one or more processor (202) and/or over distributive computing. Further, the version setting unit (230) sets a version based on the created training name. In an example, the version setting unit (230) sets the version 2.1 based on the created training name.
[0068] Further, the allocating unit (232) allocates one or more network elements (e.g., server, network functions (e.g., Access and Mobility Management Function (AMF) entity or the like)) for retraining the existing trained model based on the user selecting the one or more network elements for retraining the existing trained model via the user interface (206). In an alternate embodiment, the allocating unit (232) allocates one or more network elements for retraining the existing trained model based on one of, the service, the microservice, the application, or the component transmitting a request to the allocating unit to (232) to select the one or more network elements for retaining the existing trained model.
[0069] The example for retraining the existing trained model is explained in FIG. 4 to FIG. 6.
[0070] FIG. 3 is an example schematic representation of the system (300) of FIG. 1 in which various entities operations are explained, according to various embodiments of the present system. It is to be noted that the embodiment with respect to FIG. 3 will be explained with respect to the first UE (102-1) and the system (108) for the purpose of description and illustration and should nowhere be construed as limited to the scope of the present disclosure.
[0071] As mentioned earlier, the first UE (102-1) includes one or more primary processors (305) communicably coupled to the one or more processors (202) of the system (108). The one or more primary processors (305) are coupled with a memory (310) storing instructions which are executed by the one or more primary processors (305). Execution of the stored instructions by the one or more primary processors (305) causes the UE (102-1) to transmit, a command to select at least one of, the existing data sources or the one or more data sources to one or more processor (202).
[0072] As mentioned earlier, the one or more processors (202) is configured to transmit a response content related to the existing trained model to the UE (102-1). More specifically, the one or more processors (202) of the system (108) is configured to transmit the response content to at least one of the UE (102-1). A kernel (315) is a core component serving as the primary interface between hardware components of the UE (102-1) and the system (108). The kernel (315) is configured to provide the plurality of response contents hosted on the system (108) to access resources available in the communication network (106). The resources include one of a Central Processing Unit (CPU), memory components such as Random Access Memory (RAM) and Read Only Memory (ROM).
[0073] As per the illustrated embodiment, the system (108) includes the one or more processors (202), the memory (204), the input/output interface unit (206), the display (208), and the input device (210). The operations and functions of the one or more processors (202), the memory (204), the input/output interface unit (206), the display (208), and the input device (210) are already explained in FIG. 2. For the sake of brevity, we are not explaining the same operations (or repeated information) in the patent disclosure. Further, the processor (202) includes retrieving unit (216), the categorizing unit (218), the pre-processing unit (220), the tuning unit (222), the training unit (224), the notifying unit (226), the creating unit (228), the version setting unit (230), the allocating unit (232) and the storage unit (234). The operations and functions of the retrieving unit (216), the categorizing unit (218), the pre-processing unit (220), the tuning unit (222), the training unit (224), the notifying unit (226), the creating unit (228), the version setting unit (230), the allocating unit (232) and the storage unit (234) are already explained in FIG. 2. For the sake of brevity, we are not explaining the same operations (or repeated information) in the patent disclosure.
[0074] FIG. 4 illustrates a system architecture (400) for retraining of the existing trained model with the existing data source or the new data source, according to various embodiments of the present system. The system architecture (400) includes the one or more processors (202), the memory (204), the input/output interface unit (206), the display (208), and the input device (210). The operations and functions of the one or more processors (202), the memory (204), the input/output interface unit (206), the display (208), and the input device (210) are already explained in FIG. 2. For the sake of brevity, we are not explaining the same operations (or repeated information) in the patent disclosure. Further, the processor (202) includes retrieving unit (216), the categorizing unit (218), the pre-processing unit (220), the tuning unit (222), the training unit (224), the notifying unit (226), the creating unit (228), the version setting unit (230), and the allocating unit (232). The operations and functions of the retrieving unit (216), the categorizing unit (218), the pre-processing unit (220), the tuning unit (222), the training unit (224), the notifying unit (226), the creating unit (228), the version setting unit (230), and the allocating unit (232) are already explained in FIG. 2. For the sake of brevity, we are not explaining the same operations (or repeated information) in the patent disclosure.
[0075] Further, the system architecture (400) includes the system (108) configured to interact with an integrated system (402) via a load balancer (404) and a data-lake (406). The system (108) is integrated with a Large Language Model (LLM) to provide provisions for optimal retaining. The integrated system (402) collects the raw data from different data sources. The load balancer (404) distributes a data request traffic between the integrated system (402) and the system (108). The input device (210) of the system (108) is taking the inputs from the user. By using the input device (210), the user will give the training name. Also, by using the input device (210), a model name is selected by the user from the list of models like GPT-J, LAMA2, Bloom, GPT-neo and Falcon etc. in the system (108), the LLM as a service sets the training version by default for the specified training name. The user of the system (108) will select the execution group where the training is going to execute from the list provided by the LLM as a service. Further, the user of the system (108) allows an entry to a new data source. In simple terms, if the user selects the new data source then, the system (108) creates the new data source by retrieving the data from the data as file input or data from the source path, input stream, HTTP2, Hadoop Distributed File Systems (HDFS) and network attached storage (NAS). If the user selects the existing data source then, the user will select the data source from the list of existing data sources.
[0076] Further, the system (108) pre-processes the data received to normalize and clean the data. On the basis of the historic data and the current data, the system (108) does the preprocessing such as cleaning and normalizing the text content while removing formatting tags and irrelevant elements from the data source which contain various formatting elements such as headings, tables, footnotes, page numbers, other structural components and images; cleaning and normalizing the data by removing the noise from extra rows which contain invalid column values, such as NaN, None, 0, null, or empty strings.
[0077] Further, the hyper-parameters will be set by auto tuning by studying the data trends and patterns for the given data. The ML training will be performed on the given data. Further, the display (208) displays the training status list which contains tabular view of training name, model name, version, data source type, status and action like retrain and delete.
[0078] Further, the system (108) is connected to the data-lake (406) which is a distributed database used to store the processed data and algorithm outputs. The LLM as a service stores the trained model by performing retrain of existing model on the new data source or the existing data source which can be used by other users also for further retraining and inference.
[0079] Further, the system (108) is configured to interact with an external and internal data sources (not shown). The system (108) may further include one or more database (218) and is capable of interacting with one or more application server in the communication network (106).
[0080] Further, the system (108) may be configured to interact with various component of the communication network (106) and external network (not shown) by means of various Application Programming Interface (API), the databases (214) and servers or any other compatible element. The databases (214)/data-lakes (406) are configured to store the past data, dynamic data, and trained models for future necessity.
[0081] Further, the system (108) may further be configured to incorporate even more data into pre-processing steps, if required, to refine the data analysis. The pre-processing step involves extracting and normalizing the data by applying suitable operation filter, normalization, cleaning and standardization of data.
[0082] The system (108) assist the user to save time by optimizing the resources of the network (106). The system (108) integrated with the LLM may be configured to create the new data source and train the new data source on the provided input. Also, the user can retrain the model with the existing data source or the new data source on different existing trained model. This helps the user to get better results for provided data source with the new data source or the existing data source.
[0083] The most unique aspect of this invention is the capability to optimally retrain the models while reducing complexity of the process, consumption of time and resources. Further, the system (108) is configured to receive various data including data as file input, data from source path, data as input stream, data from HTTP2, data from HDFS and data from NAS to retrain the existing model with new data source or existing data source. The integration with LLM as a service provides the option for user to retrain the existing model like GPT-J, LAMA2, Bloom, GPT-neo and Falcon etc. on the new data source as well as existing data source. The system, LLM as a service, gives the trained model after retraining the existing model on data as file input, data from source path, data as input stream, data from HTTP2, data from HDFS and data from NAS. As the system is configured to automatically preprocess and tune data, the operation of the system (108) becomes easier for the user to train/retrain the ML model. The system (108) having LLM as a service, also provides training status list which contains tabular view of training name, model name, version, data source type, status and action. In the action column, the user of the system (108) can delete or retrain the trained model by seeing the training name, model name and version. This helps the user to retrain the model as many times on same model with new data source or existing data source with different versions. In an alternate embodiment, one of, the service, the microservice, the application or the component, may take up a follow up action of performing the above actions such as for example, delete or retrain the trained model based on the training name, model name and version.
[0084] The present system (108) may further be configured to interact with the application servers (not shown), an integrated performance management (IPM) system (not shown), a Fulfillment Management System (FMS) (not shown), a network management system (NMS) in the network (106) via the API as medium of communication and may perform the process by means of various formats like JavaScript Object Notation (JSON), Python or any other compatible formats.
[0085] FIG. 5 is a flow diagram (500) illustrating the method for retraining the existing trained model, according to various embodiments of the present disclosure.
[0086] At 502, the method includes retrieving data from at least one of: the existing data source or the new data source. In an embodiment, the method allows the retrieving unit (216) to retrieving the data from at least one of: the existing data source or the new data source.
[0087] At 504, the method includes categorizing the data as at least one of: the historic data and the current data. In an embodiment, the method allows the categorizing unit (218) to categorize the data as at least one of: the historic data and the current data.
[0088] At 506, the method includes pre-processing the historic data and the current data. In an embodiment, the method allows the pre-processing unit (220) to pre-process the historic data and the current data.
[0089] At 508, the method includes autotuning the one or more hyperparameters for the pre-processed historic data and the current data based on analysis of the historic data and the current data. In an embodiment, the method allows the tuning unit (222) to autotune the one or more hyperparameters for the pre-processed historic data and the current data based on analysis of patterns and trends of the historic data and the current data.
[0090] At 510, the method includes re-training the trained model with the historic data and the current data. The historic data and the current data are pre-processed and autotuned with the one or more hyperparameters. In an embodiment, the method allows the training unit (224) to re-train the trained model with the historic data and the current data.
[0091] At 512, the method includes notifying the user, the status of re-training the existing trained model. In an embodiment, the method allows the notifying unit (526) to notify the status of re-training the existing trained model to the user.
[0092] FIG. 6 is an example flow diagram (600) illustrating the method for retraining the existing trained model, according to various embodiments of the present disclosure.
[0093] At 602, the user of the system (108) creates the training name and selects the model name from the list of models like GPT-J, LAMA2, Bloom, GPT-neo and Falcon. At 604, the version is set by default for the particular training name. For example, if the user given training name as ‘Training-Name-1’ then, the first time version will be set to version1.1 by default. When a second time training name is given as TrainingName1 then, the version will be set as version1.2 by default. At 606, the user of the system (108) selects the execution group. The execution group is the group of different servers. Whatever execution groups are accessible to that particular user is displayed in the list and the user has to select the execution from the list. The ML training will be performed on the selected execution group.
[0094] 608, the user selects the data source existing or new. If the user selects the new data source then, the system (108) creates the new data source by retrieving the data from the data as file input, the data from the source path, the data as the input stream, the data from the HTTP2, the data from the HDFS and the data from the NAS. If the user selects the existing data source then, the user will select the data source from the list of existing data sources.
[0095] 610, the system (108) pre-processes the data source. On the basis of the historic and the current data, the system (108) does the preprocessing such as cleaning and normalizing the text content while removing formatting tags and irrelevant elements from the data source which contain various formatting elements such as headings, tables, footnotes, page numbers, other structural components and images; cleaning and normalizing the data by removing the noise from extra rows which contain invalid column values, such as NaN, None, 0, null, or empty strings.
[0096] At 612, the hyper-parameters are auto tuned and the data is given to the ML training. The ML training is performed. At 614, the system (108), integrated with LLM as the service, stores the trained model by performing retrain of existing model on the new data source or the existing data source. At 616, the system (108) displays the training status list which contains tabular view of the training name, the model name, the version, the data source type, the status and action like retrain and delete by means of a display unit.
[0097] In preferred embodiments, the method may also include various steps to collect information from the network elements like servers and other network functions, triggers consecutive operational procedures (or finetuning or the like) etc., improve learning methodology for retraining the Machine Learning Models and may not be considered strictly limited to the above method steps.
[0098] Below is the technical advancement of the present invention:
[0099] The system and method can be used to retain the previously trained model with existing or new data sources optimally, without consuming too much time or resources. The present system (108) is configured to receive various data from various sources, to automatically preprocess and tune data making it easier for the user to retrain the ML model. The system (108) also includes large language models (LLM) as a service to provide training status list as per various selection parameters that helps the user to retrain the model as many times on the same model with new data source or existing data source with different versions. The method is executed by the system (108) for optimum retraining regime with an accurate manner.
[00100] The system, with the LLM, gives optimal solution by utilizing resources in the right way and with greater time efficiency. After training, the system (108) provides the option to save the trained model and can be used by other users also for further inference.
[00101] A person of ordinary skill in the art will readily ascertain that the illustrated embodiments and steps in description and drawings (FIGS. 1-6) are set out to explain the exemplary embodiments shown, and it should be anticipated that ongoing technological development will change the manner in which particular functions are performed. These examples are presented herein for purposes of illustration, and not limitation. Further, the boundaries of the functional building blocks have been arbitrarily defined herein for the convenience of the description. Alternative boundaries can be defined so long as the specified functions and relationships thereof are appropriately performed. Alternatives (including equivalents, extensions, variations, deviations, etc., of those described herein) will be apparent to persons skilled in the relevant art(s) based on the teachings contained herein. Such alternatives fall within the scope and spirit of the disclosed embodiments.
[00102] Method steps: A person of ordinary skill in the art will readily ascertain that the illustrated steps are set out to explain the exemplary embodiments shown, and it should be anticipated that ongoing technological development will change the manner in which particular functions are performed. These examples are presented herein for purposes of illustration, and not limitation. Further, the boundaries of the functional building blocks have been arbitrarily defined herein for the convenience of the description. Alternative boundaries can be defined so long as the specified functions and relationships thereof are appropriately performed. Alternatives (including equivalents, extensions, variations, deviations, etc., of those described herein) will be apparent to persons skilled in the relevant art(s) based on the teachings contained herein. Such alternatives fall within the scope and spirit of the disclosed embodiments.
[00103] The present invention offers multiple advantages over the prior art and the above listed are a few examples to emphasize on some of the advantageous features. The listed advantages are to be read in a non-limiting manner.

REFERENCE NUMERALS
[00104] Environment - 100
[00105] UEs– 102, 102-1-102-n
[00106] Server - 104
[00107] Communication network – 106
[00108] System – 108
[00109] Processor – 202
[00110] Memory – 204
[00111] User Interface – 206
[00112] Display – 208
[00113] Input device – 210
[00114] Database – 214
[00115] Retrieving unit– 216
[00116] Categorizing unit – 218
[00117] Pre-processing unit – 220
[00118] Tuning unit - 222
[00119] Training unit – 224
[00120] Notifying unit – 226
[00121] Creating unit – 228
[00122] Version setting unit – 230
[00123] Allocating unit - 232
[00124] Storage unit - 234
[00125] System - 300
[00126] Primary processors -305
[00127] Memory– 310
[00128] Kernel– 315
[00129] System architecture – 400
[00130] Integrated system – 402
[00131] Load balancer – 404
[00132] Data-lake - 406
,CLAIMS:CLAIMS:
We Claim:
1. A method for retraining an existing trained model, the method comprising the steps of:
retrieving, by one or more processors (202), data from at least one of, an existing data source or a new data source;
categorizing, by the one or more processors (202), the data as at least one of, historic data and current data;
pre-processing, by the one or more processors (202), the historic data and the current data;
autotuning, by the one or more processors (202), one or more hyperparameters for the pre-processed historic data and the current data based on analysis of the historic data and the current data;
re-training, by the one or more processors (202), the trained model with the historic data and the current data, wherein the historic data and the current data are pre-processed and autotuned with the one or more hyperparameters; and
notifying, by the one or more processors (202), at least one of, a user or one of, a service, a microservice, a component and an application, a status of re-training the existing trained model.

2. The method as claimed in claim 1, wherein the step of, retrieving, data from an existing data source, includes the step of:
retrieving, by the one or more processors (202), the data from the existing data source based on at least one of:
the user selecting the existing data source via a user interface (206); or
one of, a service, microservice, component and an application transmits a command to the one or more processors to select the existing data source.

3. The method as claimed in claim 1, wherein the step of, retrieving, data from a new data source, includes the steps of:
creating, by the one or more processors (202), the new data source when at least one of:
the user selects one or more data sources from a list of data sources via the user interface (206), or
one of, the service, the microservice, the component and the application transmits a command to the one or more processors to select the one or more data sources;
pulling, by the one or more processors (202), the data from one or more data sources and storing the data at the new data source; and
retrieving, by the one or more processors (202), the data from the new data source.

4. The method as claimed in claim 1, wherein the data is categorized as at least one of, the historic data and the current data based on a time of generation of the data.

5. The method as claimed in claim 1, wherein the pre-processing of the historic data and the current data, includes at least one of, cleaning and normalizing text content, removing and/or formatting tags, removing irrelevant elements and removing noise.

6. The method as claimed in claim 1, wherein the method further comprising the steps of:
creating, by the one or more processors (202), a training name and a model name for retraining the trained model based on receiving the training name and the user selecting the model name from a list of model names via the user interface (206);
setting, by the one or more processors (202), a version based on the created training name; and
allocating, by the one or more processors (202), one or more network elements for retraining the existing trained model based on one of, the user selecting the one or more network elements for retraining the existing trained model via the user interface (206), or one of, the service, the microservice, the application or the component transmitting a request to the one or more processors to select the one or more network elements for retaining the existing trained model.

7. The method as claimed in claim 1, wherein the status of notifying the user of re-training the existing trained model, includes at least one of, status of completion of retraining the existing trained model utilizing one or more identifiers including at least one of, training name, model name, version, type and/or name of the data source used and one or more actions including at least one of, retrain or delete the trained model, wherein the user is notified of training the model by at least one of, alerts or notifications.

8. The method as claimed in claim 1, wherein the step of, re-training, the trained model with the historic data and the current data, further includes the step of:
storing, by the one or more processors, the re-trained model in a storage unit.

9. The method as claimed in claim 1, wherein one or more processors notifies one of, the service, the microservice, the component and the application by, transmitting, an acknowledgement to at least one of, the service, the microservice, the component and the application pertaining to the status of re-training the existing trained model.

10. A system (108) for retraining an existing trained model, the system (108) comprising:
a retrieving unit (216), configured to, retrieve, data from at least one of, an existing data source or a new data source;
a categorizing unit (218), configured to, categorize, the data as at least one of, historic data and current data;
a pre-processing unit (220), configured to, pre-process, the historic data and the current data;
a tuning unit (222), configured to, autotune, one or more hyperparameters for the pre-processed historic data and the current data based on analysis of the historic data and the current data;
a training unit (224), configured to, re-train, the trained model with the historic data and the current data, wherein the historic data and the current data are pre-processed and autotuned with the one or more hyperparameters; and
a notifying unit (226), configured to, notify, at least one of, a user, or one of, a service, a microservice, a component and an application, a status of re-training the existing trained model.

11. The system (108) as claimed in claim 10, wherein the retrieving unit (216), retrieves, the data from the existing data source, by:
retrieving, the data from the existing data source based on at least one of:
the user selecting the existing data source via a user interface (206), or
one of, a service, microservice, component and an application transmits a command to the retrieving unit (216) to select the existing data source.

12. The system (108) as claimed in claim 10, wherein the retrieving unit (216), retrieves the data from the new data source, by:
creating, the new data source when at least one of:
the user selects one or more data sources from a list of data sources via the user interface (206); or
one of, the service, the microservice, the component and the application transmits a command to the one or more processors to select the one or more data sources;
pulling, the data from one or more data sources and storing the data at the new data source; and
retrieving, the data from the new data source.

13. The system (108) as claimed in claim 10, wherein the data is categorized as at least one of, the historic data and the current data based on a time of generation of the data.

14. The system (108) as claimed in claim 10, wherein the pre-processing of the historic data and the current data, includes at least one of, cleaning and normalizing text content, removing and/or formatting tags, removing irrelevant elements and removing noise.

15. The system (108) as claimed in claim 10, wherein the system (108) further comprising:
a creating unit (228), configured to, create, a training name and a model name for retraining the trained model based on receiving the training name and the user selecting the model name from a list of model names via the user interface (206);
a version setting unit (230), configured to, set, a version based on the created training name; and
an allocating unit (232), configured to, allocate, one or more network elements for retraining the existing trained model based on at least one of, the user selecting the one or more network elements for retraining the existing trained model via the user interface (206) or, one of, the service, the microservice, the application, or the component transmitting a request to the allocating unit (232) to select the one or more network elements for retaining the existing trained model.

16. The system (108) as claimed in claim 10, wherein the status of notifying the user of re-training the existing trained model, includes at least one of, status of completion of re-training the existing trained model utilizing one or more identifiers including at least one of, training name, model name, version, type and/or name of the data source used and one or more actions including at least one of, retrain or delete the trained model.

17. The system as claimed in claim 10, wherein the training unit (224) is further configured to, store, the re-trained model in a storage unit (234).

18. The system as claimed in claim 10, wherein the notifying unit (226) notifies one of, the service, the microservice, the component and the application by, transmitting, an acknowledgement to at least one of, the service, the microservice, the component and the application pertaining to the status of re-training the existing trained model based on information of mapping received from a handling unit (236).

19. A User Equipment (UE) (102), comprising:
one or more primary processors (305), communicatively coupled to one or more processors (205) in a network (106), wherein the one or more primary processors (305) are coupled with a memory (310) stores instructions, when executed by the one or more primary processors (305), cause the UE (102) to:
transmit, a command to select at least one of, the existing data sources or the one or more data sources to one or more processor (202), wherein the one or more processors (202) is configured to perform the steps of claim 1.

Documents

Application Documents

# Name Date
1 202321067268-STATEMENT OF UNDERTAKING (FORM 3) [06-10-2023(online)].pdf 2023-10-06
2 202321067268-PROVISIONAL SPECIFICATION [06-10-2023(online)].pdf 2023-10-06
3 202321067268-FORM 1 [06-10-2023(online)].pdf 2023-10-06
4 202321067268-FIGURE OF ABSTRACT [06-10-2023(online)].pdf 2023-10-06
5 202321067268-DRAWINGS [06-10-2023(online)].pdf 2023-10-06
6 202321067268-DECLARATION OF INVENTORSHIP (FORM 5) [06-10-2023(online)].pdf 2023-10-06
7 202321067268-FORM-26 [27-11-2023(online)].pdf 2023-11-27
8 202321067268-Proof of Right [12-02-2024(online)].pdf 2024-02-12
9 202321067268-DRAWING [07-10-2024(online)].pdf 2024-10-07
10 202321067268-COMPLETE SPECIFICATION [07-10-2024(online)].pdf 2024-10-07
11 Abstract.jpg 2024-12-30
12 202321067268-Power of Attorney [24-01-2025(online)].pdf 2025-01-24
13 202321067268-Form 1 (Submitted on date of filing) [24-01-2025(online)].pdf 2025-01-24
14 202321067268-Covering Letter [24-01-2025(online)].pdf 2025-01-24
15 202321067268-CERTIFIED COPIES TRANSMISSION TO IB [24-01-2025(online)].pdf 2025-01-24
16 202321067268-FORM 3 [31-01-2025(online)].pdf 2025-01-31