Abstract: Disclosed are a system, method and apparatus for classification of data in a machine learning system. In one aspect, a method for classification of data in a machine learning system through one or more computer processors is disclosed. Further, generating, through one or more computer processors, a data classifier using a first dataset and determining an accuracy value of the data classifier to achieve a predefined model accuracy threshold. Still further, iterating, through one or more computer processors, calibration of the first dataset based on a set of parameters till the accuracy value matches or exceeds the predefined model accuracy threshold value. Further, the calibration comprises a user input to indicate a correctness of a presented subset of data from a second dataset and using the above to generate an enhanced data classifier for the classification of data.
WE CLAIM:
1. A computer implemented method for classification of data in a machine learning system comprising:
generating a data classifier using a first dataset;
determining an accuracy value of the data classifier to achieve a predefined model accuracy threshold;
iterating calibration of the first dataset based on a set of parameters till the accuracy value matches or exceeds the predefined model accuracy threshold value; wherein the calibration comprises at least a user input to indicate a correctness of a presented subset of data from a second dataset;
generating, using the calibration, an enhanced data classifier for the classification of data.
2. The method of claim 1, wherein the data classifier is generated based on a training data available for the first dataset.
3. The method of claim 1, wherein the calibration is performed based on the set of parameters comprising one or more of a certainty score, a click utilization value, a certainty threshold value, a model accuracy value, an average machine learning time and an average annotation time or a combination thereof.
4. The method of the claim 4, wherein the certainty threshold based on set of parameters is dynamically adjusted until the predefined model accuracy threshold is achieved
5. The method of claim 1 wherein the user input received over the presented subset of data is used to generate the click utilization and the average annotation time
23
6. The method of claim 1, wherein the presented subset of data is generated from the second dataset based on the certainty threshold value and a computed candidate vector distance.
7. The method of claim 1, wherein the iterating of calibration further comprises;
coupling the presented subset of data with the user input and the training data till the predefined model accuracy threshold is achieved.
8. A data classification system comprising:
a processor; and
a memory coupled to the processor configured to be capable of executing programmed instructions comprising and stored in the memory to;
generate a data classifier using a first dataset;
determine an accuracy value of the data classifier to achieve a predefined model accuracy threshold;
iterate a calibration of the first dataset based on a set of parameters till the accuracy value matches or exceeds the predefined model accuracy threshold value; wherein the calibration comprises at least a user input to indicate an /correctness of a presented subset of data from a second dataset;
generate, using the calibration, an enhanced data classifier for the classification of data.
9. The system of claim 9, wherein the wherein the processor is further
configured to be capable of executing the stored programmed instructions to
generated the data classifier based on a training data available for the first dataset.
24
10. The system of claim 9, wherein the processor is further configured to be capable of executing the stored programmed instructions to perform the calibration is performed based on the set of parameters comprising one or more of a certainty score, a click utilization value, a certainty threshold value, a model accuracy value, an average machine learning time and an average annotation time or a combination thereof.
11. The system of the claim 12, wherein the processor is further configured to be capable of executing the stored programmed instructions to dynamically adjust the certainty threshold based on set of parameters until the predefined model accuracy threshold is achieved
12. The system of claim 9, wherein the processor is further configured to be capable of executing the stored programmed instructions to generate the click utilization and the average annotation time, based on the user input received over the presented subset of data.
13. The system of claim 9 wherein the processor is further configured to be capable of executing the stored programmed instructions to generate the presented subset of data from the second dataset based on the certainty threshold value and a computed candidate vector distance.
14. The system of claim 9, wherein the processor is further configured to be capable of executing the stored programmed instructions to iterate of calibration, further comprising;
coupling the presented subset of data with the user input and the training data till the predefined model accuracy threshold is achieved.
Dated this on April 02,2019 ^_^
Swetha S N
Of K& S Partners
25 Agent for the Ap,
IN/PA-2123
| # | Name | Date |
|---|---|---|
| 1 | 201941013239-Response to office action [12-12-2023(online)].pdf | 2023-12-12 |
| 1 | 201941013239-STATEMENT OF UNDERTAKING (FORM 3) [04-02-2019(online)].pdf | 2019-02-04 |
| 2 | 201941013239-FORM 1 [04-02-2019(online)].pdf | 2019-02-04 |
| 2 | 201941013239-FER_SER_REPLY [07-03-2022(online)].pdf | 2022-03-07 |
| 3 | 201941013239-PETITION UNDER RULE 137 [07-03-2022(online)].pdf | 2022-03-07 |
| 3 | 201941013239-DRAWINGS [04-02-2019(online)].pdf | 2019-02-04 |
| 4 | 201941013239-FER.pdf | 2021-10-17 |
| 4 | 201941013239-DECLARATION OF INVENTORSHIP (FORM 5) [04-02-2019(online)].pdf | 2019-02-04 |
| 5 | 201941013239-FORM 18 [11-08-2020(online)].pdf | 2020-08-11 |
| 5 | 201941013239-COMPLETE SPECIFICATION [04-02-2019(online)].pdf | 2019-02-04 |
| 6 | 201941013239-FORM-26 [21-05-2019(online)].pdf | 2019-05-21 |
| 6 | 201941013239-Form 1 (Submitted on date of filing) [09-07-2020(online)].pdf | 2020-07-09 |
| 7 | Correspondence by Agent_Form 26_27-05-2019.pdf | 2019-05-27 |
| 7 | 201941013239-Power of Attorney [09-07-2020(online)].pdf | 2020-07-09 |
| 8 | 201941013239-Request Letter-Correspondence [09-07-2020(online)].pdf | 2020-07-09 |
| 8 | 201941013239-Proof of Right (MANDATORY) [15-10-2019(online)].pdf | 2019-10-15 |
| 9 | 201941013239-Request Letter-Correspondence [09-07-2020(online)].pdf | 2020-07-09 |
| 9 | 201941013239-Proof of Right (MANDATORY) [15-10-2019(online)].pdf | 2019-10-15 |
| 10 | 201941013239-Power of Attorney [09-07-2020(online)].pdf | 2020-07-09 |
| 10 | Correspondence by Agent_Form 26_27-05-2019.pdf | 2019-05-27 |
| 11 | 201941013239-FORM-26 [21-05-2019(online)].pdf | 2019-05-21 |
| 11 | 201941013239-Form 1 (Submitted on date of filing) [09-07-2020(online)].pdf | 2020-07-09 |
| 12 | 201941013239-FORM 18 [11-08-2020(online)].pdf | 2020-08-11 |
| 12 | 201941013239-COMPLETE SPECIFICATION [04-02-2019(online)].pdf | 2019-02-04 |
| 13 | 201941013239-FER.pdf | 2021-10-17 |
| 13 | 201941013239-DECLARATION OF INVENTORSHIP (FORM 5) [04-02-2019(online)].pdf | 2019-02-04 |
| 14 | 201941013239-PETITION UNDER RULE 137 [07-03-2022(online)].pdf | 2022-03-07 |
| 14 | 201941013239-DRAWINGS [04-02-2019(online)].pdf | 2019-02-04 |
| 15 | 201941013239-FORM 1 [04-02-2019(online)].pdf | 2019-02-04 |
| 15 | 201941013239-FER_SER_REPLY [07-03-2022(online)].pdf | 2022-03-07 |
| 16 | 201941013239-STATEMENT OF UNDERTAKING (FORM 3) [04-02-2019(online)].pdf | 2019-02-04 |
| 16 | 201941013239-Response to office action [12-12-2023(online)].pdf | 2023-12-12 |
| 1 | SearchHistoryE_06-08-2021.pdf |
| 2 | AmendedSearchAE_13-09-2022.pdf |