Anomaly Detection In Industrial Internet Of Things (Iot)

< Back

Anomaly Detection In Industrial Internet Of Things (Iot)

Abstract: Correlation among the data collected from devices/machines connected in an Internet of Things (IoT) network, and its interpretation still remains a challenge to perform any of the above tasks in their applications. Embodiments of the present disclosure provide systems and methods that implement a kernelized dictionary learning framework for carrying out regression to model signals having a complex nonlinear nature. A joint optimization is carried out where the regression weights are learnt together with the dictionary and coefficients. Model learnt for each machining process are used to predict signature(s) of corresponding operations of an IoT machine for the given test input. The predicted values are then compared with the actuals acquired from the machine to check for deviation. If any deviation is observed, it is regarded as a change which can be due to an anomaly or change in external factors.

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #

Filing Date

18 April 2018

Publication Number

43/2019

Publication Type

INA

Invention Field

ELECTRICAL

Status

ip@legasis.in

Parent Application

Patent Number

Legal Status

Grant Date

2024-01-05

Renewal Date

Applicants

Tata Consultancy Services Limited

Nirmal Building, 9th Floor, Nariman Point, Mumbai - 400021, Maharashtra, India

Inventors

1. KUMAR, Kriti

Tata Consultancy Services Limited, #152, Gopalan Global Axis H - Block, Opposite Satya Sai Hospital, ITPL Main road, EPIP Zone, Whitefield, Bangalore - 560066, Karnataka, India

2. CHANDRA, Girish Mariswamy

Tata Consultancy Services Limited, #152, Gopalan Global Axis H - Block, Opposite Satya Sai Hospital, ITPL Main road, EPIP Zone, Whitefield, Bangalore - 560066, Karnataka, India

3. KUMAR, Achanna Anil

Tata Consultancy Services Limited, #152, Gopalan Global Axis H - Block, Opposite Satya Sai Hospital, ITPL Main road, EPIP Zone, Whitefield, Bangalore - 560066, Karnataka, India

4. MAJUMDAR, Angshul

Indraprastha Institute of Information Technology, University of New Delhi, Okhla Industrial Estate, Phase III, Near Govind Puri Metro Station, New Delhi - 110020, New Delhi, India

Specification

Claims:1. A processor implemented method, comprising: receiving raw sensor data from one or more sensors pertaining to an Internet of Things (IoT) machine, wherein the one or more sensors are comprised in the IoT machine; extracting critical data parameters from the raw sensor data by performing correlation analysis on the raw sensor data, wherein the critical data parameters are specific to performance of the IoT machine; generating one or more unique Kernel Dictionary Learning Regression (KDLR) models corresponding to each operation performed by the IoT machine by kernelizing at least a portion of the extracted critical data parameters by using a Kernel Dictionary Learning Regression (KDLR) technique, wherein each of the one or more unique KDLR models comprises a Kernel dictionary (A), one or more coefficients (z), and one or more regression weights (w), wherein the one or more generated KDLR models are learnt by performing joint optimization of the Kernel dictionary (A), the one or more coefficients (z), and one or more regression weights (w) to obtain one or more updated parameters comprising an updated Kernel dictionary (A), one or more updated coefficients (z), and one or more updated regression weights (w), wherein the Kernel dictionary (A), the one or more coefficients (z), and one or more regression weights (w) are jointly optimized until a stopping criterion is reached, based on one or more inputs fed to, and one or more corresponding generated outputs by the IoT machine in a training phase; receiving an input test sensor data pertaining to one or more corresponding operations performed by the IoT machine; predicting, by using the one or more learnt KDLR models, one or more unique signatures specific to one or more critical data parameters from the input test sensor data, wherein the predicted one or more unique signatures are specific to the one or more corresponding operations performed by the IoT machine; and detecting one or more anomalies based on a comparison of (i) the predicted one or more unique signatures specific to one or more corresponding operations performed by the IoT machine and (ii) one or more actual signatures generated for an actual test sensor data. 2. The processor implemented method of claim 1, wherein the raw sensor data comprises: at least one of one or more time stamps, one or more statuses of the IoT machine, program and operation information pertaining to the IoT machine, one or more tool-axis positions of the IoT machine, spindle speed, axis feed, servo loads, spindle loads, and information specific to component identifiers and stages of operations performed by the IoT machine. 3. The processor implemented method of claim 1, further comprising mapping the detected one or more anomalies with one or more events of the IoT machine to obtained mapped data, wherein information pertaining the one or more events is stored in a database. 4. The processor implemented method of claim 4, further comprising predicting one or more conditions pertaining to the IoT machine and one or more tools associated thereof based on the mapped data. 5. The processor implemented method of claim 5, further comprising generating one or more alerts based on the mapped data and the one or more predicted conditions. 6. The processor implemented method of claim 1, wherein the critical data parameters comprise of at least one of one or more control parameters and one or more operation quality parameters. 7. The processor implemented method of claim 6, further comprising capturing one or more complex non-linear relationships between the one or more control parameters and the one or more operation quality parameters by kernelizing the one or more control parameters . 8. The processor implemented method of claim 1, further comprising tuning, by using an exhaustive grid search technique, one or more hyper parameters associated with the joint optimization for generating the one or more KDLR models. 9. The processor implemented method of claim 1, wherein the stopping criterion comprises at least one of a convergence is reached or a maximum iteration is attained. 10. A system (100) comprising: a memory (102) storing instructions and one or more modules (108); one or more communication interfaces (106); and one or more hardware processors (104) coupled to the memory (102) via the one or more communication interfaces (106), wherein the one or more hardware processors (104) are configured by the instructions to: receive raw sensor data from one or more sensors pertaining to an Internet of Things (IoT) machine, wherein the one or more sensors are comprised in the IoT machine; extract critical data parameters from the raw sensor data by performing correlation analysis on the raw sensor data, wherein the critical data parameters are specific to performance of the IoT machine; generate one or more unique Kernel Dictionary Learning Regression (KDLR) models corresponding to each operation performed by the IoT machine by kernelizing at least a portion of the extracted critical data parameters by using a Kernel Dictionary Learning Regression (KDLR) technique, wherein each of the one or more unique KDLR models comprises a Kernel dictionary (A), one or more coefficients (z), and one or more regression weights (w), wherein the one or more generated KDLR models are learnt by performing joint optimization of the Kernel dictionary (A), the one or more coefficients (z), and one or more regression weights (w) to obtain one or more updated parameters comprising an updated Kernel dictionary (A), one or more updated coefficients (z), and one or more updated regression weights (w), wherein the Kernel dictionary (A), the one or more coefficients (z), and one or more regression weights (w) are jointly optimized until a stopping criterion is reached, based on one or more inputs fed to, and one or more corresponding generated outputs by the IoT machine in a training phase; receive an input test sensor data pertaining to one or more corresponding operations performed by the IoT machine; predict, by using the one or more learnt KDLR models, one or more unique signatures specific to one or more critical data parameters from the input test sensor data, wherein the predicted one or more unique signatures are specific to the one or more corresponding operations performed by the IoT machine; and detect one or more anomalies based on a comparison of (i) the predicted one or more unique signatures specific to one or more corresponding operations performed by the IoT machine and (ii) one or more actual signatures generated for an actual test sensor data. 11. The system of claim 10, wherein the raw sensor data comprises: at least one of one or more time stamps, one or more statuses of the IoT machine, program and operation information pertaining to the IoT machine, one or more tool-axis positions of the IoT machine, spindle speed, axis feed, servo loads, spindle loads, and information specific to component identifiers and stages of operations performed by the IoT machine. 12. The system of claim 10, wherein the one or more hardware processors are further configured by the instructions to map the detected one or more anomalies with one or more events of the IoT machine to obtained mapped data, wherein information pertaining the one or more events is stored in a database. 13. The system of claim 12, wherein the one or more hardware processors are further configured by the instructions to predict one or more conditions pertaining to the IoT machine and one or more tools associated thereof based on the mapped data. 14. The system of claim 13, wherein the one or more hardware processors are further configured by the instructions to generate one or more alerts based on the mapped data and the one or more predicted conditions. 15. The system of claim 10, wherein the critical data parameters comprise of at least one of one or more control parameters and one or more operation quality parameters. 16. The system of claim 10, wherein the one or more hardware processors are further configured by the instructions to capture one or more complex non-linear relationships between the one or more control parameters and the one or more operation quality parameters by kernelizing the one or more control parameters. 17. The system of claim 10, wherein the one or more hardware processors are further configured by the instructions to tune, by using an exhaustive grid search technique, one or more hyper parameters associated with the joint optimization for generating the one or more KDLR models. 18. The system of claim 11, wherein the stopping criterion comprises at least one of a convergence is reached or a maximum iteration is attained. , Description:FORM 2 THE PATENTS ACT, 1970 (39 of 1970) & THE PATENT RULES, 2003 COMPLETE SPECIFICATION (See Section 10 and Rule 13) Title of invention: ANOMALY DETECTION IN INDUSTRIAL INTERNET OF THINGS (IoT) Applicant: Tata Consultancy Services Limited A company Incorporated in India under the Companies Act, 1956 Having address: Nirmal Building, 9th floor, Nariman point, Mumbai 400021, Maharashtra, India The following specification particularly describes the invention and the manner in which it is to be performed. TECHNICAL FIELD The disclosure herein generally relates to anomaly detection, and, more particularly, to anomaly detection in industrial Internet of Things (IoT). BACKGROUND With the advancement of technology, data analytics have evolved over time and so is the reliability on analytics. In order to understand the data and make effective use of them, it is necessary to have appropriate data-driven methods to capture the nature of data. With this understanding, one can carry out different inference tasks for example, classification, clustering, regression, and the like. However, massive size of the data collected from devices/machines connected in an Internet of Things (IoT) network, correlation among the data and its interpretation still remains a challenge to perform any of the above tasks in their applications. SUMMARY Embodiments of the present disclosure present technological improvements as solutions to one or more of the above-mentioned technical problems recognized by the inventors in conventional systems. For example, in one aspect, there is provided a processor implemented method for anomaly detection in industrial Internet of Things (IoT), comprising: receiving raw sensor data from one or more sensors pertaining to an Internet of Things (IoT) machine, wherein the one or more sensors are comprised in the IoT machine; extracting critical data parameters from the raw sensor data by performing correlation analysis on the raw sensor data, wherein the critical data parameters are specific to performance of the IoT machine; generating one or more unique Kernel Dictionary Learning Regression (KDLR) models corresponding to each operation performed by the IoT machine by kernelizing at least a portion of the extracted critical data parameters by using a Kernel Dictionary Learning Regression (KDLR) technique, wherein each of the one or more unique KDLR models comprises a Kernel dictionary (A), one or more coefficients (Z), and one or more regression weights (w), wherein the one or more generated KDLR models are learnt by performing joint optimization of the Kernel dictionary (A), the one or more coefficients (Z), and one or more regression weights (w) to obtain one or more updated parameters comprising an updated Kernel dictionary (A), one or more updated coefficients (Z), and one or more updated regression weights (w), wherein the Kernel dictionary (A), the one or more coefficients (Z), and one or more regression weights (w) are jointly optimized until a stopping criterion is reached, based on one or more inputs fed to, and one or more corresponding generated outputs by the IoT machine in a training phase, the stopping criterion comprises at least one of a convergence is reached or a maximum iteration is attained; receiving an input test sensor data pertaining to one or more corresponding operations performed by the IoT machine; predicting, by using the one or more learnt KDLR models, one or more unique signatures specific to one or more critical data parameters from the input test sensor data, wherein the predicted one or more unique signatures are specific to the one or more corresponding operations performed by the IoT machine; and detecting one or more anomalies based on a comparison of (i) the predicted one or more unique signatures specific to one or more corresponding operations performed by the IoT machine and (ii) one or more actual signatures generated for an actual test sensor data. In an embodiment, the raw sensor data may comprises: at least one of one or more time stamps, one or more statuses of the IoT machine, program and operation information pertaining to the IoT machine, one or more tool-axis positions of the IoT machine, spindle speed, axis feed, servo loads, spindle load, and information specific to component identifiers and stages of operations performed by the IoT machine. In an embodiment, the method may further comprise mapping the detected one or more anomalies with one or more events of the IoT machine to obtained mapped data, wherein information pertaining the one or more events is stored in a database. In an embodiment, the method may further comprise predicting one or more conditions pertaining to the IoT machine and one or more tools associated thereof based on the mapped data. In an embodiment, the method may further comprise generating one or more alerts based on the mapped data and the one or more predicted conditions. In an embodiment, the critical data parameters comprise of at least one of one or more control parameters and one or more operation quality parameters. In an embodiment, the step of kernelizing at least a portion of the extracted critical data parameters comprises kernelizing the one or more control parameters to capture one or more complex non-linear relationships between the one or more control parameters and the one or more operation quality parameters. In an embodiment, the method may further comprise tuning, by using an exhaustive grid search technique, one or more hyper parameters associated with the joint optimization for generating the one or more KDLR models. In another aspect, there is provided a system for anomaly detection in industrial Internet of Things (IoT), comprising: a memory storing instructions and one or more modules; one or more communication interfaces; and one or more hardware processors coupled to the memory via the one or more communication interfaces, wherein the one or more hardware processors are configured by the instructions to: receive raw sensor data from one or more sensors pertaining to an Internet of Things (IoT) machine, wherein the one or more sensors are comprised in the IoT machine; extract critical data parameters from the raw sensor data by performing correlation analysis on the raw sensor data, wherein the critical data parameters are specific to performance of the IoT machine; generate one or more unique Kernel Dictionary Learning Regression (KDLR) models corresponding to each operation performed by the IoT machine by kernelizing at least a portion of the extracted critical data parameters by using a Kernel Dictionary Learning Regression (KDLR) technique, wherein each of the one or more unique KDLR models comprises a Kernel dictionary (A), one or more coefficients (Z), and one or more regression weights (w), wherein the one or more generated KDLR models are learnt by performing joint optimization of the Kernel dictionary (A), the one or more coefficients (Z), and one or more regression weights (w) to obtain one or more updated parameters comprising an updated Kernel dictionary (A), one or more updated coefficients (Z), and one or more updated regression weights (w), wherein the Kernel dictionary (A), the one or more coefficients (Z), and one or more regression weights (w) are jointly optimized until a stopping criterion is reached, based on one or more inputs fed to, and one or more corresponding generated outputs by the IoT machine in a training phase, the stopping criterion comprises at least one of a convergence is reached or a maximum iteration is attained; receive an input test sensor data pertaining to one or more corresponding operations performed by the IoT machine; predict, by using the one or more learnt KDLR models, one or more unique signatures specific to one or more critical data parameters from the input test sensor data, wherein the predicted one or more unique signatures are specific to the one or more corresponding operations performed by the IoT machine; and detect one or more anomalies based on a comparison of (i) the predicted one or more unique signatures specific to one or more corresponding operations performed by the IoT machine and (ii) one or more actual signatures generated for an actual test sensor data. In an embodiment, the raw sensor data comprises: at least one of one or more time stamps, one or more statuses of the IoT machine, program and operation information pertaining to the IoT machine, one or more tool-axis positions of the IoT machine, spindle speed, axis feed, servo loads, spindle load, and information specific to component identifiers and stages of operations performed by the IoT machine. In an embodiment, the one or more hardware processors are further configured by the instructions to map the detected one or more anomalies with one or more events of the IoT machine to obtained mapped data, wherein information pertaining the one or more events is stored in a database. In an embodiment, the one or more hardware processors are further configured by the instructions to predict one or more conditions pertaining to the IoT machine and one or more tools associated thereof based on the mapped data. In an embodiment, the one or more hardware processors are further configured by the instructions to generate one or more alerts based on the mapped data and the one or more predicted conditions. In an embodiment, the critical data parameters comprise of at least one of one or more control parameters and one or more operation quality parameters. In an embodiment, the one or more complex non-linear relationships between the one or more control parameters and the one or more operation quality parameters are captured by kernelizing the one or more control parameters. In an embodiment, the one or more hardware processors are further configured by the instructions to tune, by using an exhaustive grid search technique, one or more hyper parameters associated with the joint optimization for generating the one or more KDLR models. In yet another aspect, there is provided one or more non-transitory machine readable information storage mediums comprising one or more instructions which when executed by one or more hardware processors causes a method comprising: receiving raw sensor data from one or more sensors pertaining to an Internet of Things (IoT) machine, wherein the one or more sensors are comprised in the IoT machine; extracting critical data parameters from the raw sensor data by performing correlation analysis on the raw sensor data, wherein the critical data parameters are specific to performance of the IoT machine; generating one or more unique Kernel Dictionary Learning Regression (KDLR) models corresponding to each operation performed by the IoT machine by kernelizing at least a portion of the extracted critical data parameters by using a Kernel Dictionary Learning Regression (KDLR) technique, wherein each of the one or more unique KDLR models comprises a Kernel dictionary (A), one or more coefficients (Z), and one or more regression weights (w), wherein the one or more generated KDLR models are learnt by performing joint optimization of the Kernel dictionary (A), the one or more coefficients (Z), and one or more regression weights (w) to obtain one or more updated parameters comprising an updated Kernel dictionary (A), one or more updated coefficients (Z), and one or more updated regression weights (w), wherein the Kernel dictionary (A), the one or more coefficients (Z), and one or more regression weights (w) are jointly optimized until a stopping criterion is reached, based on one or more inputs fed to, and one or more corresponding generated outputs by the IoT machine in a training phase, the stopping criterion comprises at least one of a convergence is reached or a maximum iteration is attained; receiving an input test sensor data pertaining to one or more corresponding operations performed by the IoT machine; predicting, by using the one or more learnt KDLR models, one or more unique signatures specific to one or more critical data parameters from the input test sensor data, wherein the predicted one or more unique signatures are specific to the one or more corresponding operations performed by the IoT machine; and detecting one or more anomalies based on a comparison of (i) the predicted one or more unique signatures specific to one or more corresponding operations performed by the IoT machine and (ii) one or more actual signatures generated for an actual test sensor data. In an embodiment, the raw sensor data may comprises: at least one of one or more time stamps, one or more statuses of the IoT machine, program and operation information pertaining to the IoT machine, one or more tool-axis positions of the IoT machine, spindle speed, axis feed, servo loads, spindle loads, and information specific to component identifiers and stages of operations performed by the IoT machine. In an embodiment, the method may further comprise mapping the detected one or more anomalies with one or more events of the IoT machine to obtained mapped data, wherein information pertaining the one or more events is stored in a database. In an embodiment, the method may further comprise predicting one or more conditions pertaining to the IoT machine and one or more tools associated thereof based on the mapped data. In an embodiment, the method may further comprise generating one or more alerts based on the mapped data and the one or more predicted conditions. In an embodiment, the critical data parameters comprise of at least one of one or more control parameters and one or more operation quality parameters. In an embodiment, the step of kernelizing at least a portion of the extracted critical data parameters by using a Kernel Dictionary Learning Regression (KDLR) technique to obtain one or more unique KDLR models comprises kernelizing the one or more control parameters to capture one or more complex non-linear relationships between the one or more control parameters and the one or more operation quality parameters. In an embodiment, the method may further comprise tuning, by using an exhaustive grid search technique, one or more hyper parameters associated with the joint optimization for generating the one or more KDLR models. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed. BRIEF DESCRIPTION OF THE DRAWINGS The accompanying drawings, which are incorporated in and constitute a part of this disclosure, illustrate exemplary embodiments and, together with the description, serve to explain the disclosed principles: FIG. 1 illustrates an exemplary block diagram of a system for anomaly detection in industrial Internet of Things (IoT) according to an embodiment of the present disclosure. FIG. 2 is an exemplary flow diagram illustrating a method for anomaly detection in industrial Internet of Things (IoT) using the system of FIG. 1 according to an embodiment of the present disclosure. FIG. 3 is a graphical representation depicting correlation analysis of the raw sensor data according to an embodiment of the present disclosure. FIG. 4A is a graphical representation depicting comparative Spindle load estimation results for Pocket Rough operation using different regression methods according to an embodiment of the present disclosure. FIG. 4B is a graphical representation depicting comparative Spindle load estimation results for Pocket Rough operation using different regression methods according to an embodiment of the present disclosure. FIG. 5 is a graphical representation depicting Mean Squared Error (MSE) for spindle load estimate according to an embodiment of the present disclosure. DETAILED DESCRIPTION OF EMBODIMENTS Exemplary embodiments are described with reference to the accompanying drawings. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. Wherever convenient, the same reference numbers are used throughout the drawings to refer to the same or like parts. While examples and features of disclosed principles are described herein, modifications, adaptations, and other implementations are possible without departing from the spirit and scope of the disclosed embodiments. It is intended that the following detailed description be considered as exemplary only, with the true scope and spirit being indicated by the following claims. As mentioned above, data analytics have evolved over time and so is the reliability on analytics. In order to understand the data and make effective use of them, it is necessary to have appropriate data-driven methods to capture the nature of data. With this understanding, one can carry out different inference tasks like, classification, clustering and regression. Restricting to data modeling, many of the existing techniques from the data analysis community can be tried. For any data analysis, it is necessary to identify dependent variables also known as responses or predicands, and independent variables or predictors. The relationship between the predictors and responses is described by a regression function (e.g., refer ‘Wolfgang Hardle, Applied Nonparametric Regression, Cambridge University Press, 1990’). This function approximation approach is useful to model the data, to characterize different states of the data generating source. For example, for Computer Numerical Control (CNC) machines; given labeled data for normal operation, one can model the CNC machine performance (appropriate dependent variables) as a function of several other independent variables which can be from different sensors. The regression function learnt could then be used to assess the performance of CNC machine for an unknown test input. If the estimated and actual response is similar it depicts normal behavior else a change is detected. If the change is significant, it can be associated with abnormal/anomalous behavior with the help of additional information. Since there is no “One fits All” solution for carrying out the modeling addressing different varieties of data, one needs to have multiple techniques. Signal processing can provide systematic framework to arrive at new data-driven models. In the present disclosure, a kernel dictionary learning framework for carrying out regression is proposed. Since its introduction dictionary learning has been used profusely for analysis and synthesis problems especially arising in image processing (e.g., refer (i) ‘I. Tosic and P. Frossard, “Dictionary learning,” IEEE Signal Processing Magazine, vol. 28, no. 2, pp. 27–38, March 2011’, (ii) ‘Chenglong Bao, Hui Ji, Yuhui Quan, Zuowei Shen, undefined, undefined, undefined, and undefined, “Dictionary learning for sparse coding: Algorithms and convergence analysis,” IEEE Transactions on Pattern Analysis & Machine Intelligence, vol. 38, no. 7, pp. 1356–1369, 2016’, (iii) ‘Julien Mairal, Jean Ponce, Guillermo Sapiro, Andrew Zisserman, and Francis R. Bach, “Supervised dictionary learning,” in Advances in Neural Information Processing Systems 21, D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, Eds., pp. 1033–1040. Curran Associates, Inc., 2009’, (iv) ‘G. Chen and D. Needell, “Compressed sensing and dictionary learning,” Finite Frame Theory: A Complete Introduction to Overcompleteness, vol. 73, 2016’). The basic formulation is given by way of illustrative expression below: X=DZ (1) where X?R^N is the data that is represented by the learnt dictionary of basis D?R^(N*K) containing atoms as its columns and the learnt coefficients Z?R^K. The origin of dictionary learning lies in matrix factorization (e.g., refer ‘Daniel D. Lee and H. Sebastian Seung, “Learning the parts of objects by nonnegative matrix factorization,” Nature, vol. 401, pp. 788–791, 1999’) and sparse coding (e.g., refer ‘Bruno A. Olshausen and David J. Field, “Sparse coding with an overcomplete basis set: A strategy employed by v1?,” Vision Research, vol. 37, no. 23, pp. 3311 – 3325, 1997’). The standard matrix factorization problem can be solved by the Method of Optimal Directions (MOD) (e.g., refer ‘K. Engan, S. O. Aase, and J. Hakon Husoy, “Method of optimal directions for frame design,” in 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258), 1999, vol. 5, pp. 2443–2446 vol.5’) by alternvatively solving for the two variables: (_D,Z^min)?X-DZ?_F^2 (2) For sparse coding problems, where Z is constrained to be sparse, K-SVD algorithm (e.g., refer ‘Michal Aharon, Michael Elad, and Alfred M. Bruckstein, “Ksvd and its non-negative variant for dictionary design,” in Proceedings of the SPIE conference wavelets, 2005, pp. 327–339’) is utilized. This solves for a dictionary using below illustrative expression such that the coefficients are sparse. (_D,Z^min)?X-DZ?_F^2 s.t.?Z?_0=t (3) where ?Z?_0 is the usual l_0 sparsity measure which counts the number of non-zero elements in Z. This optimization results in a sparse representation of data which is learning using maximum t non-zero entries of Z. The unsupervised version of sparse coding has been used profusely for solving inverse problems for example, denoising, deblurring, inpainting, reconstructions and the like (e.g., refer (i) ‘Yann LeCun, “Machine Learning and Pattern Recognition: Unsupervised Learning Sparse Coding,” https://www.cs.nyu.edu/˜yann/2010f-G22-2565-001/diglib/lecture12-sparse-coding.pdf/, 2010’, (ii) ‘Julien Mairal, Francis Bach, Jean Ponce, and Guillermo Sapiro, “Online dictionary learning for sparse coding,” in Proceedings of the 26th Annual International Conference on Machine Learning, New York, NY, USA, 2009, ICML ’09, pp. 689–696, ACM’, and (iii) ‘J. Mairal, M. Elad, and G. Sapiro, “Sparse representation for color image restoration,” Trans. Img. Proc., vol. 17, no. 1, pp. 53–69, Jan. 2008’. Researchers, particularly working in machine learning domains, have used dictionary learning for feature extraction. But instead of using the basic unsupervised formulation discriminative penalties are added to above equation (3) for improved analysis (e.g., refer (i) ‘Bruno A. Olshausen and David J. Field, “Sparse coding with an overcomplete basis set: A strategy employed by v1?,” Vision Research, vol. 37, no. 23, pp. 3311 – 3325, 1997’, and (ii) ‘Ivana Tosic and Pascal Frossard, “Dictionary learning: What is the right representation for my signal?,” IEEE Signal Processing Magazine, vol. 28, no. 2, pp. 27–38, 2011’). In standard dictionary learning, a dense dictionary needs to be learnt from the data. There are two issues with this approach firstly, using limited data leads to overfitting, and secondly large scale problems cannot be handled owing to explicit computations with the dictionary. To address both these issues doubly sparse dictionary learning was implemented (e.g., refer ‘Ron Rubinstein, Michael Zibulevsky, and Michael Elad, “Double sparsity: learning sparse dictionaries for sparse signal approximation,” IEEE Trans. Signal Processing, vol. 58, no. 3, pp. 1553–1564, 2010’). The basic idea was to express the dictionary as an arbitrary sparse linear combination of fixed basis (e.g., wavelet DCT Fourier and the like.). The model is expressed by way of following illustrative equation as: X=?AZ (4) Here, ? is a combination of pre-defined basis, A is the combining weights that picks up appropriate basis from ? to form the dictionary. Here, A and Z both need to be learnt. This is framed as: (_A,Z^min)?X-?AZ?_F^2 s.t.?Z?_0=t and ?A?_0=? (5) The concept of kernel dictionary learning from research (e.g., refer ‘H. V. Nguyen, V. M. Patel, N. M. Nasrabadi, and R. Chellappa, “Kernel dictionary learning,” in 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), March 2012, pp. 2021–2024’) is somewhat related to doubly sparse dictionary learning; both of them express the dictionary as a linear combination of a fixed basis. In kernel dictionary learning, the fixed basis is a non-linear combination of the data. It is essentially a non-linear mapping from R^N to high dimensional feature space F. So, instead of expressing the original data, as in standard sparse dictionary learning, kernel dictionary learning expresses the non-linear version of the data in terms of a dictionary formed by a linear combination of the non-linear version of the data. Mathematically, this is expressed as found in the research by H. V. Nguyen et al (e.g., refer above): f(X)=f(X)AZ (6) where f(X) is the matrix obtained by transforming X to a high dimensional feature space F. The learning algorithm is expressed as: (_A,Z^min)?f(X)-f(X)AZ?_F^2 s.t.?Z?_0=t (7) The optimization problem in above equation (7) is solved using alternate minimization like in the case of dictionary learning. Few attempts have been made that make use of kernel dictionary learning framework for image classification tasks (e.g., refer (i) ‘H. Van Nguyen, V. M. Patel, N. M. Nasrabadi, and R. Chellappa, “Design of non-linear kernel dictionaries for object recognition,” IEEE Transactions on Image Processing, vol. 22, no. 12, pp. 5123–5135, Dec 2013’ and (ii) ‘A. Golts and M. Elad, “Linearized kernel dictionary learning,” IEEE Journal of Selected Topics in Signal Processing, vol. 10, no. 4, pp. 726–739, June 2016’). Results demonstrate that exploiting non-linear sparsity via learning dictionaries in a non-linear feature space provides superior performance compared to their linear counterparts and kernel PCA. Recently, some works have also reported the use of a unified objective function to jointly learn the dictionary and sparse linear classifier/regressor (e.g., refer (i) ‘Z. Jiang, Z. Lin, and L. S. Davis, “Label consistent k-svd: Learning a discriminative dictionary for recognition,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 35, no. 11, pp. 2651–2664, Nov 2013’, (ii) ‘R. Ganti and R. M. Willett, “Sparse Linear Regression With Missing Data,” ArXiv e-prints, Mar. 2015’, and (iii) ‘Ziyu Wang, Jianxiong Liu, and Jing-Hao Xue, “Joint sparse model-based discriminative k-svd for hyperspectral image classification,” Signal Processing, vol. 133, no. Supplement C, pp. 144 – 155, 2017’). In the work by Z. Jiang et al, a label consistent K-SVD algorithm has been presented to learn a discriminative dictionary for sparse coding for object recognition tasks. The work done by R. Ganti et al., has presented a fast method for sparse regression in the presence of missing data. Both these methods had better performance over other sparse coding based techniques as the label information was utilized for learning via joint optimization. Motivated by the method of joint optimization and the need to handle non-linearities in the data, in the present disclosure, Kernel Dictionary Learning framework for Regression (KDLR) is proposed. There remains a need for kernel dictionary learning based regression where the regression formulation is learnt within the dictionary learning framework. This technique is shown to outperform the traditional Linear Regression (LR), Kernel Regression (KR), Least Absolute Shrinkage and Selection Operator (LASSO) and Dictionary Learning based techniques for regression (DLR) especially when the data (times series) exhibits certain complex non-linear evolution. To elaborate on the proposed framework and demonstrate its applicability for regression analysis, the present disclosure implements a kernelized dictionary learning framework for carrying out regression to model signals having a complex nonlinear nature. A joint optimization is carried out where the regression weights are learnt together with the dictionary and coefficients. Relevant formulation and dictionary building steps are provided. To demonstrate the effectiveness of the proposed technique, elaborate experimental results using different real-life datasets are herein described by way of examples. Referring now to the drawings, and more particularly to FIG. 1 through 5, where similar reference characters denote corresponding features consistently throughout the figures, there are shown preferred embodiments and these embodiments are described in the context of the following exemplary system and/or method. FIG. 1 illustrates an exemplary block diagram of a system 100 for anomaly detection in industrial Internet of Things (IoT) according to an embodiment of the present disclosure. The system 100 may also be referred as an Anomaly Detection System (ADS) hereinafter. In an embodiment, the system 100 includes one or more processors 104, communication interface device(s) or input/output (I/O) interface(s) 106, and one or more data storage devices or memory 102 operatively coupled to the one or more processors 104. The memory 102 comprises one or more modules 108 and the database 110. The one or more processors 104 that are hardware processors can be implemented as one or more microprocessors, microcomputers, microcontrollers, digital signal processors, central processing units, state machines, logic circuitries, and/or any devices that manipulate signals based on operational instructions. Among other capabilities, the processor(s) is configured to fetch and execute computer-readable instructions stored in the memory. In an embodiment, the system 100 can be implemented in a variety of computing systems, such as laptop computers, notebooks, hand-held devices, workstations, mainframe computers, servers, a network cloud and the like. The I/O interface device(s) 106 can include a variety of software and hardware interfaces, for example, a web interface, a graphical user interface, and the like and can facilitate multiple communications within a wide variety of networks N/W and protocol types, including wired networks, for example, LAN, cable, etc., and wireless networks, such as WLAN, cellular, or satellite. In an embodiment, the I/O interface device(s) can include one or more ports for connecting a number of devices to one another or to another server. The memory 102 may include any computer-readable medium known in the art including, for example, volatile memory, such as static random access memory (SRAM) and dynamic random access memory (DRAM), and/or non-volatile memory, such as read only memory (ROM), erasable programmable ROM, flash memories, hard disks, optical disks, and magnetic tapes. The database 110 may store information pertaining to raw sensor data obtained from one or more sensors integrated within an IoT machine (e.g., a Computer numerical control (CNC) machine, and the like). Further, the database 110 may store information pertaining to pre-processing of raw sensor data (e.g., filtering non-essential data from the raw sensor data which involves extraction of critical data parameters). Furthermore, the database 110 includes information pertaining to Kernel Dictionary Learning Regression (KDLR) models generation, learning of the KDLR models, and parameters associated thereof. In an embodiment, the parameters include, a Kernel dictionary (A), one or more coefficients (Z), and one or more regression weights (w) which are jointly optimized so as to obtain learnt KDLR models that are used for predicting anomalies in the IoT machines. Moreover, the database 110 stores information pertaining to inputs fed to the system 100 and/or outputs generated by the system specific to the methodology described herein. FIG. 2, with reference to FIG. 1, is an exemplary flow diagram illustrating a method for anomaly detection in industrial Internet of Things (IoT) using the system 100 of FIG. 1 according to an embodiment of the present disclosure. In an embodiment, the system 100 comprises one or more data storage devices or the memory 102 operatively coupled to the one or more hardware processors 104 and is configured to store instructions for execution of steps of the method by the one or more processors 104. The steps of the method of the present disclosure will now be explained with reference to the components of the system 100 as depicted in FIG. 1, and the flow diagram of FIG. 2. In an embodiment of the present disclosure, at step 202, the one or more hardware processors 104 receives raw sensor data from one or more sensors pertaining to an Internet of Things (IoT) machine (e.g., a CNC machine), wherein the one or more sensors are comprised in the IoT machine. In an embodiment, the raw sensor data may comprises but are not limited to, at least one of one or more time stamps, one or more statuses of the IoT machine, program and operation information pertaining to the IoT machine, one or more tool-axis positions of the IoT machine, spindle speed, axis feed, servo loads, spindle load, and information specific to component identifiers and stages of operations performed by the IoT machine. Below Table 1 is sample information pertaining to the IoT machine (raw sensor data collected from one or more sensors comprised in the CNC machine) illustrated by way of example: Table 1 Time Isrunning Motion Axis1_MachinePosition Feed_data Speed_Data Axis 1 Servo load Axis 2 Servo load Axis 3 Servo load Spindle 1 – load 0:0:0 3 1 -394698 1282 2499 11 16 38 0 0:0:1 3 0 -390165 323 2499 4 4 40 0 0:0:2 3 1 -392766 388 2499 4 4 42 0 0:0:3 3 1 -300841 2000 2499 11 3 40 0 … … … … … … … … … … 0:0:16 3 1 -35000 2000 2499 0 10 41 0 In an embodiment of the present disclosure, at step 204, the one or more hardware processors 104 extract critical data parameters from the raw sensor data by performing correlation analysis on the raw sensor data, wherein the critical data parameters are specific to performance of the IoT machine. In an embodiment of the present disclosure, the correlation analysis comprises identifying one or more relationships between one or more control parameters and one or more operation quality parameters from the extracted critical data parameters. In an embodiment, the critical data parameters may comprise but are not limited to, one or more control parameters, one or more operation quality parameters, or combinations thereof. FIG. 3, with reference to FIGS. 1 through 2, is a graphical representation depicting correlation analysis of raw sensor data according to an embodiment of the present disclosure. FIG. 3 depicts an output parameter (e.g., spindle load) and 2 candidate input parameters for predicting spindle load for an operation (e.g., say N7) wherein the candidate input parameters are ‘Fz’ (feed rate along z-axis) and Axis3_DistanceToGo’ (distance to be traveled along z-axis). It is evident by the graphical representation of FIG. 3 that ‘Spindle load’ likely has a correlation with ‘Fz’ but very unlikely with ‘Axis3_DistanceToGo’. This observation was further confirmed upon calculating Pearson’s correlation coefficient wherein Pearons_corr_coef( "Spindle Load","Fz") = 0.31 and Pearons_corr_coef( "Spindle Load","Axis3_DistanceToGo" ) = 0.03. The steps 202 and 204 are described by way of following use case scenario: Given a multi-variate data of N samples, let X? R^(L*N) represent independent variables of feature vector length L and y? R^N represent dependent variable. The present disclosure and system 100 associated thereof incorporates a ridge regression penalty into the kernel dictionary learning framework for carrying out a joint optimization where the dictionary atoms, coefficients and the regression weights are learnt together. In an embodiment the raw sensor data comprising Spindle Speed, axis speed and the like are referred as independent variables. Kernelization takes care of the non-linearities in the system and hence a simple linear regression formulation is sufficient after the transformation. Mathematically, the proposed formulation is given as: (_A,Z,w^min)?f(X)-f(X)AZ?_F^2+??y-wZ?_2^2 +µ?w?_2^2 (8) where f(X)=[f(x_1 ),…f(x_N )], A?R^(N*K) is the atom representation dictionary, Z?R^(k*N) are the coefficients and w?R^K are the regression weights. It is to be noted that in the above equation (8), sparsity term is not included and the present disclosure attempts the problem of generating KDLR regression models. Like any machine learning technique, the proposed technique of the present disclosure has a training phase where, the dictionary atoms, coefficients and regression weights are learnt a test where, the learnt dictionary and regression weights are used for estimating the response variable. Referring back to steps of FIG. 2, in an embodiment of the present disclosure, at step 206, the one or more hardware processors 104 generating one or more unique Kernel Dictionary Learning Regression (KDLR) models corresponding to each operation performed by the IoT machine by kernelizing at least a portion of the extracted critical data parameters by using a Kernel Dictionary Learning Regression (KDLR) technique. As mentioned above, each of the one or more unique KDLR models comprises a Kernel dictionary (A), one or more coefficients (z), and one or more regression weights (w) wherein the one or more generated KDLR models are learnt by performing joint optimization of the Kernel dictionary (A), the one or more coefficients (z), and the one or more regression weights (w) to obtain one or more updated parameters comprising an updated Kernel dictionary (A), one or more updated coefficients (z), and one or more updated regression weights (w).In an embodiment, the Kernel dictionary (A), the one or more coefficients (z), and one or more regression weights (w) are jointly optimized until a stopping criterion is reached, based on one or more inputs fed to, and one or more corresponding generated outputs by the IoT machine in a training phase. In an embodiment the stopping criterion comprises (i) a convergence is reached, (ii) a maximum iteration is attained, or (iii) combinations thereof. Convergence refers to a value (e.g., pre-defined value/threshold) that the Kernel dictionary (A), the one or more coefficients (z), and one or more regression weights (w) should reach. In an embodiment, the one or more inputs fed to the IoT machine (e.g., the CNC machine) may comprise, but are not limited to, tool position, feed rate, and the like and wherein the one or more outputs generated by the IoT machine may comprise but are not limited to, servo load, spindle load, and the like. In an embodiment of the present disclosure, the step of kernelizing at least a portion of the extracted critical data parameters by using a Kernel Dictionary Learning Regression (KDLR) technique to obtain one or more unique KDLR models comprises kernelizing one or more complex non-linear relationships between the one or more control parameters and the one or more operation quality parameters. The step of kernelizing comprises transforming the original data (raw sensor data) to high dimensional space (not shown in FIGS). Different kernels (e.g., Gaussian, uniform, tricube, cosine kernels, and the like) can be used to transform the raw sensor data to higher dimensional space, thereby identifying one or more complex non-linear relationships. Training Phase: An illustrative example of the training phase is described below where joint optimization is performed for the Kernel dictionary (A), the one or more coefficients (z), and one or more regression weights to solve equation (8) by solving sub-problems as depicted in below example expressions: A?(_A^min)?f(X)-f(X)AZ?_F^2 (9) Z?(_Z^min)?f(X)-f(X)AZ?_F^2+??y-wZ?_2^2 (10) w?(_w^min)?y-wZ?_2^2+µ?w?_2^2 (11) Solving for A using equation (9) results as given below: A=Z^T ?(ZZ^T)?^(-1) (12) The update for Z is obtained by taking the derivative of the expression in (10) and equating it to 0. After mathematical manipulations, following modified normal equation is arrived: A^T K(X,X)A+?w^T w)Z=A^T K(X,X)+?w^T y (13) Here, K(X,X)?R^(N*N) is the kernel matrix of finite dimension whose elements are computed from: k(x_i,x_j )=f(x_j )^T f(x_j ) ?_i,j=1,….,N. Above equation (13) has an analytic solution, but for large volume of data, the kernel matrix is huge so it is not recommended to invert explicitly. However, a few steps of Conjugate Gradient (CG) can be used to solve equation (13) instead. The update for the regression weights upon solving the sub-problems for A and Z, is given as: w(?ZZ^T+µI)=?yZ^T (14) where I is an all ones matrix of size K*K. Upon completing the training and once the KDLR models are learnt with the updated parameters (e.g., an updated Kernel dictionary (A), one or more updated coefficients (Z), and one or more updated regression weights (w), at step 208, the one or more hardware processors 104 receive (during a test phase) an input test sensor data pertaining to one or more corresponding operations performed by the IoT machine and one or more unique signatures specific to one or more critical data parameters from the input test sensor data are predicted by using the one or more learnt KDLR models at step 210. The predicted one or more unique signatures are specific to the one or more corresponding operations performed by the IoT machine. In an embodiment of the present disclosure, at step 212, the one or more hardware processors 104 detect one or more anomalies based on a comparison of (i) the predicted one or more unique signatures specific to one or more corresponding operations performed by the IoT machine and (ii) one or more actual signatures generated for an actual test sensor data. The detected one or more anomalies are mapped with one or more events of the IoT machine to obtained mapped data, wherein information pertaining the one or more events are stored in the database 108. In an embodiment of the present disclosure, in case the anomalies are detected in an IoT machine such as a CNC machine, the one or more events may comprise but are not limited to, tool change, alarms, maintenance data, and the like. Using the mapped data, the one or more hardware processors 104 further predict one or more conditions pertaining to the IoT machine (e.g., the CNC machine). In an embodiment, the one or more conditions that are predicted comprise but are not limited to, tool and machine conditions. For example, conditions of the machine may comprise but are not limited to, tool wear, tool breakage, machine consuming more power for certain operations which may be due to faults with different motors, such as axis servo motors, spindle motors etc., changes due to change in raw material of the workpiece, change in cutting tool material, external environmental conditions, and the like. Based on the mapped data and the one or more predicted conditions, the one or more hardware processors 104 are further configured by the instructions to generating one or more alerts. For example, alerts may comprise but are not limited to, notifications to instruct operator(s) when to change tool(s) so that quality of end product(s) is/are not affected, check for tool condition, other factors which could have led to change as described above and the like. Further the one or more hardware processors 104 are configured by the instructions to tune, by using an exhaustive grid search technique (known in the art technique), one or more hyper parameters (e.g., ?,and µ) associated with the joint optimization for generating the one or more KDLR models. An illustrative example of the test phase is described below: Test Phase: During testing, given a new test sample (e.g., the input test sensor data) x_test, corresponding dependent variables or output y ^_test (e.g., one or more unique signatures specific to one or more critical data parameters from the input test sensor data) by first computing corresponding feature z_test. The model is expressed as following example equation: f?(x?_test)=f(X)Az_test (15) Note that the updated kernel dictionary A does not change from the training phase; it is still defined by the linear combination of the non-linear version of training data. The solution for z_test is formulated as: (_z_test^min)?f(x_test )-f(X)Az_test ?_F^2 (16) Following the derivation as before, the following is observed: A^T K(X,X)Az_test=A^T K?(x_test,X)?^T (17) where, K(x_test,X)=[k(x_test,x_1 ),… k(x_test,x_N )]. Either equation (17) can be solved by computing the inverse (for sub-problems / small problems) or solve it efficiently using CG. Once the feature z_test is obtained, it is multiplied by the learnt regression weights to get the y ^_test. y ^_test=wz_test (18) The pseudo code of the proposed KDLR technique is presented as example below: Input: Set of training data, X= X_train,y=y_train, and K (size of dictionary), parameters (?,µ), and kernel function k, test data x_test Output: Learnt dictionary X, weight vector w, estimated output y ^_test Initialization: Set Z_0 to random matrix with real number between 0 and 1 drawn from a uniform distribution, w_0=y/Z and A_0=O, iteration i=1 1: Procedure 2: loop: Repeat until convergence (or fixed number of iterations Matrix r) 3: A_i?Z_(i-1)^T ?(Z_(i-1) Z_(i-1)^T)?^(-1) 4: Normalize each column in A_i to a unit norm 5: Z_i? update using A_i and w_(i-1) using equation (13) 6: w_i??yZ_i^T ?(?Z_i Z_i^T+µI)?^(-1) 7: i ?i+1 8: if ?A_i-A_(i-1) ?_F

Documents

Application Documents

#	Name	Date
1	201821014787-STATEMENT OF UNDERTAKING (FORM 3) [18-04-2018(online)].pdf	2018-04-18
2	201821014787-REQUEST FOR EXAMINATION (FORM-18) [18-04-2018(online)].pdf	2018-04-18
3	201821014787-FORM 18 [18-04-2018(online)].pdf	2018-04-18
4	201821014787-FORM 1 [18-04-2018(online)].pdf	2018-04-18
5	201821014787-FIGURE OF ABSTRACT [18-04-2018(online)].jpg	2018-04-18
6	201821014787-DRAWINGS [18-04-2018(online)].pdf	2018-04-18
7	201821014787-COMPLETE SPECIFICATION [18-04-2018(online)].pdf	2018-04-18
8	201821014787-FORM-26 [22-05-2018(online)].pdf	2018-05-22
9	201821014787-Proof of Right (MANDATORY) [23-05-2018(online)].pdf	2018-05-23
10	Abstract1.jpg	2018-08-11
11	201821014787-ORIGINAL UNDER RULE 6 (1A)-300518.pdf	2018-08-11
12	201821014787-FER.pdf	2020-07-23
13	201821014787-OTHERS [23-01-2021(online)].pdf	2021-01-23
14	201821014787-FER_SER_REPLY [23-01-2021(online)].pdf	2021-01-23
15	201821014787-COMPLETE SPECIFICATION [23-01-2021(online)].pdf	2021-01-23
16	201821014787-CLAIMS [23-01-2021(online)].pdf	2021-01-23
17	201821014787-US(14)-HearingNotice-(HearingDate-20-12-2023).pdf	2023-11-09
18	201821014787-FORM-26 [14-12-2023(online)].pdf	2023-12-14
19	201821014787-FORM-26 [14-12-2023(online)]-1.pdf	2023-12-14
20	201821014787-Correspondence to notify the Controller [14-12-2023(online)].pdf	2023-12-14
21	201821014787-Written submissions and relevant documents [01-01-2024(online)].pdf	2024-01-01
22	201821014787-PatentCertificate05-01-2024.pdf	2024-01-05
23	201821014787-IntimationOfGrant05-01-2024.pdf	2024-01-05

Search Strategy

1	Search_Strategy_Amended_201821014787AE_12-10-2021.pdf
2	Search_Strategy_201821014787E_16-07-2020.pdf