Sign In to Follow Application
View All Documents & Correspondence

A System And Method For Optical Character Recognition

Abstract: A method and a device (102) for creating and training machine learning models is disclosed. In an embodiment, a method for training a machine learning model for identifying entities from data includes creating (302) a first plurality of clusters from a first plurality of data samples in a first dataset (204) and a second plurality of clusters from a second plurality of data samples in a second dataset (206). The method further includes determining (304) a rank for each of the first plurality of clusters and a rank for each of the second plurality of clusters (306). The method includes retraining (308) the machine learning model using at least one of the first plurality of clusters weighted based on the rank determined for each of the first plurality of clusters and at least one of the second plurality of clusters weighted based on the rank determined for each of the second Dluralitv of clusters.

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #
Filing Date
28 September 2018
Publication Number
14/2020
Publication Type
INA
Invention Field
ELECTRONICS
Status
Email
Parent Application

Applicants

L&T TECHNOLOGY SERVICES LIMITED
DLF IT SEZ PARK, 2ND FLOOR - BLOCK 3, 1/124, MOUNT POONAMALLEE ROAD, RAMAPURAM, CHENNAI - 600 089.

Inventors

1. MRIDUL BALARAMAN
B 206,SVS PALMS2, CHENNAPANHALI MAIN ROAD, DODANEKUNDI BANGALORE, KARANATAKA, INDIA, 560037.
2. MADHUSUDAN SINGH
B-603, AJMERA STONE PARK, 1ST CROSS, ELETRONIC CITY-1 BANGALORE, KARANATAKA, INDIA, 560100.
3. VIDYA SURESH
#40, RAMACHANDRA GARDENS,2ND CROSS, SHANTHI LAYOUT, RAMAMURTHY NAGAR, BANGALORE, KARNATAKA, INDIA, 560016.
4. BHUPINDER SINGH
2597,TYPE 2, DMW COLONY, PATIALA, PUNJAB, INDIA, 147003.
5. KARTIK NIVRITTIKADAM
509, NARAYANADRI BLOCK, S.V.R.S. BRINDAVANAM, VENKATESHWARA COLONY, SAROOR NAGAR, HYDERABAD, TELANGANA, INDIA 500 035.

Specification

1. A method for training of character models for a plurality of characters for optical character recognition (OCR), the method comprising:
storing at least one character model for each character in a database, each character model being trained on a first set and a second set to provide a probability of occurrence of a character in an image data wherein the first set contains a set of images of the character associated with a respective character model, and the second set contains a set of images of characters other than the character associated with the respective character model; and for each character model of the plurality of character models,
selecting a plurality of images from the first set of a character by:
generating clusters from the set of images in the first set using a clustering algorithm;
on prior presence of a character model corresponding to the character, ranking clusters of the first set based on an average probability of occurrence of images in each cluster and on absence of a character model corresponding to the character, providing a predefined rank to each cluster; and
selecting one or more images from each first set cluster weighted by rank of the cluster, selecting a plurality of images from a second set of the character by:
generating clusters from the set of images in the second set using a clustering algorithm;
on prior presence of a character model corresponding to the character, ranking clusters of the second set based on an average probability of occurrence of images in each cluster and on absence of a character model corresponding to

the character, ranking each cluster using a distance metric from the first set cluster; and
selecting one or more images from each cluster in the second set weighted by rank of the cluster.
training a new character model for the character based on the selected images from the first set and second set of the character; and
updating the database with the new character model.
2. The method as claimed in claim 1 wherein the image data is obtained by applying an OCR process to a text of a document.
3. The method as claimed in claim 1 wherein the database is updated with the new trained model on absence of a prior model corresponding to the character.
4. The method as claimed in claim 2 wherein the database is updated with the new trained model if a performance of the new character model is greater than performance of a prior model.
5. The method as claimed in claim 1 wherein the first set contains a set of samples of the character of varied font, style and size.
6. A system for optical character recognition (OCR) for a plurality of characters using a plurality of character models in a database, the system comprising:
a data store configured to store at least one character model for each character in a
database, each character model being trained on a first set and a second set to provide a
11

probability of occurrence of a character in an image data wherein the first set contains a set of images of the character associated with a respective character model, and the second set contains a set of images of characters other than the character associated with the respective character model; and
an image selector module configured to perform:
for each character model of the plurality of character models,
selecting a plurality of images from the first set of a character by:
generating clusters from the set of images in the first set using a clustering algorithm;
on prior presence of a character model corresponding to the character, ranking clusters of the first set based on an average probability of occurrence of images in each cluster and on absence of a character model corresponding to the character, providing a predefined rank to each cluster; and
selecting one or more images from each first set cluster weighted by rank of the cluster, selecting a plurality of images from a second set of the character by:
generating clusters from the set of images in the second set using a clustering algorithm;
on prior presence of a character model corresponding to the character, ranking clusters of the second set based on an average probability of occurrence of images in each cluster and on absence of a character model corresponding to the character, ranking each cluster using a distance metric from the first set cluster; and
selecting one or more images from each cluster in the second set weighted by rank of the cluster; and

a training module configured to train a new character model for the character based on the selected images from the first set and second set of the character.
7. The system as claimed in claim 5 further comprising updating the database with the new character model.

Documents

Application Documents

# Name Date
1 201841036688-AMENDED DOCUMENTS [04-02-2025(online)].pdf 2025-02-04
1 201841036688-CLAIMS [27-12-2021(online)].pdf 2021-12-27
1 201841036688-Correspondence to notify the Controller [20-03-2025(online)].pdf 2025-03-20
1 201841036688-US(14)-HearingNotice-(HearingDate-09-12-2024).pdf 2024-11-12
1 Form5_As Filed_28-09-2018.pdf 2018-09-28
2 201841036688-AMENDED DOCUMENTS [04-02-2025(online)].pdf 2025-02-04
2 201841036688-CLAIMS [27-12-2021(online)].pdf 2021-12-27
2 201841036688-COMPLETE SPECIFICATION [27-12-2021(online)].pdf 2021-12-27
2 201841036688-FORM 13 [04-02-2025(online)].pdf 2025-02-04
2 Form3_As Filed_28-09-2018.pdf 2018-09-28
3 201841036688-COMPLETE SPECIFICATION [27-12-2021(online)].pdf 2021-12-27
3 201841036688-CORRESPONDENCE [27-12-2021(online)].pdf 2021-12-27
3 201841036688-FORM 13 [04-02-2025(online)].pdf 2025-02-04
3 201841036688-MARKED COPIES OF AMENDEMENTS [04-02-2025(online)].pdf 2025-02-04
3 Form1_As Filed_28-09-2018.pdf 2018-09-28
4 201841036688-CORRESPONDENCE [27-12-2021(online)].pdf 2021-12-27
4 201841036688-FER_SER_REPLY [27-12-2021(online)].pdf 2021-12-27
4 201841036688-MARKED COPIES OF AMENDEMENTS [04-02-2025(online)].pdf 2025-02-04
4 201841036688-RELEVANT DOCUMENTS [04-02-2025(online)].pdf 2025-02-04
4 Description Provisional_As Filed_28-09-2018.pdf 2018-09-28
5 Correspondence by Applicant_As Filed_28-09-2018.pdf 2018-09-28
5 201841036688-US(14)-HearingNotice-(HearingDate-09-12-2024).pdf 2024-11-12
5 201841036688-RELEVANT DOCUMENTS [04-02-2025(online)].pdf 2025-02-04
5 201841036688-OTHERS [27-12-2021(online)].pdf 2021-12-27
5 201841036688-FER_SER_REPLY [27-12-2021(online)].pdf 2021-12-27
6 Form1_After Filing_12-03-2019.pdf 2019-03-12
6 201841036688-US(14)-HearingNotice-(HearingDate-09-12-2024).pdf 2024-11-12
6 201841036688-OTHERS [27-12-2021(online)].pdf 2021-12-27
6 201841036688-FER.pdf 2021-10-17
6 201841036688-CLAIMS [27-12-2021(online)].pdf 2021-12-27
7 201841036688-CLAIMS [27-12-2021(online)].pdf 2021-12-27
7 201841036688-COMPLETE SPECIFICATION [27-12-2021(online)].pdf 2021-12-27
7 201841036688-Correspondence_12-03-2020.pdf 2020-03-12
7 201841036688-FER.pdf 2021-10-17
7 Correspondence By Applicant_Form1_12-03-2019.pdf 2019-03-12
8 201841036688-COMPLETE SPECIFICATION [27-12-2021(online)].pdf 2021-12-27
8 201841036688-CORRESPONDENCE [27-12-2021(online)].pdf 2021-12-27
8 201841036688-Correspondence_12-03-2020.pdf 2020-03-12
8 201841036688-Form18_Examination request_12-03-2020.pdf 2020-03-12
8 Form2 Title Page_Complete_27-09-2019.pdf 2019-09-27
9 201841036688-CORRESPONDENCE [27-12-2021(online)].pdf 2021-12-27
9 201841036688-FER_SER_REPLY [27-12-2021(online)].pdf 2021-12-27
9 201841036688-Form18_Examination request_12-03-2020.pdf 2020-03-12
9 Correspondence by Agent_Certified Copy of Priority Document_09-10-2019.pdf 2019-10-09
9 Form-1_After Provisional_27-09-2019.pdf 2019-09-27
10 201841036688-FER_SER_REPLY [27-12-2021(online)].pdf 2021-12-27
10 201841036688-OTHERS [27-12-2021(online)].pdf 2021-12-27
10 Correspondence by Agent_Certified Copy of Priority Document_09-10-2019.pdf 2019-10-09
10 Correspondence by Agent_Form-3_09-10-2019.pdf 2019-10-09
10 Drawing_After Provisional_27-09-2019.pdf 2019-09-27
11 Form3_As Filed_09-10-2019.pdf 2019-10-09
11 Description Complete_After Provisional_27-09-2019.pdf 2019-09-27
11 Correspondence by Agent_Form-3_09-10-2019.pdf 2019-10-09
11 201841036688-OTHERS [27-12-2021(online)].pdf 2021-12-27
11 201841036688-FER.pdf 2021-10-17
12 201841036688-Correspondence_12-03-2020.pdf 2020-03-12
12 201841036688-FER.pdf 2021-10-17
12 Abstract_After Provisional_27-09-2019.pdf 2019-09-27
12 Correspondence by Applicant_After Provisional_27-09-2019.pdf 2019-09-27
12 Form3_As Filed_09-10-2019.pdf 2019-10-09
13 Claims_After Provisional_27-09-2019.pdf 2019-09-27
13 Abstract_After Provisional_27-09-2019.pdf 2019-09-27
13 201841036688-Form18_Examination request_12-03-2020.pdf 2020-03-12
13 201841036688-Correspondence_12-03-2020.pdf 2020-03-12
14 201841036688-Form18_Examination request_12-03-2020.pdf 2020-03-12
14 Abstract_After Provisional_27-09-2019.pdf 2019-09-27
14 Claims_After Provisional_27-09-2019.pdf 2019-09-27
14 Correspondence by Agent_Certified Copy of Priority Document_09-10-2019.pdf 2019-10-09
14 Correspondence by Applicant_After Provisional_27-09-2019.pdf 2019-09-27
15 Form3_As Filed_09-10-2019.pdf 2019-10-09
15 Description Complete_After Provisional_27-09-2019.pdf 2019-09-27
15 Correspondence by Applicant_After Provisional_27-09-2019.pdf 2019-09-27
15 Correspondence by Agent_Form-3_09-10-2019.pdf 2019-10-09
15 Correspondence by Agent_Certified Copy of Priority Document_09-10-2019.pdf 2019-10-09
16 Correspondence by Agent_Form-3_09-10-2019.pdf 2019-10-09
16 Description Complete_After Provisional_27-09-2019.pdf 2019-09-27
16 Drawing_After Provisional_27-09-2019.pdf 2019-09-27
16 Form3_As Filed_09-10-2019.pdf 2019-10-09
17 Form3_As Filed_09-10-2019.pdf 2019-10-09
17 Form-1_After Provisional_27-09-2019.pdf 2019-09-27
17 Drawing_After Provisional_27-09-2019.pdf 2019-09-27
17 Abstract_After Provisional_27-09-2019.pdf 2019-09-27
17 Correspondence by Agent_Certified Copy of Priority Document_09-10-2019.pdf 2019-10-09
18 Claims_After Provisional_27-09-2019.pdf 2019-09-27
18 Form-1_After Provisional_27-09-2019.pdf 2019-09-27
18 Form2 Title Page_Complete_27-09-2019.pdf 2019-09-27
18 Abstract_After Provisional_27-09-2019.pdf 2019-09-27
18 201841036688-Form18_Examination request_12-03-2020.pdf 2020-03-12
19 201841036688-Correspondence_12-03-2020.pdf 2020-03-12
19 Claims_After Provisional_27-09-2019.pdf 2019-09-27
19 Correspondence by Applicant_After Provisional_27-09-2019.pdf 2019-09-27
19 Correspondence By Applicant_Form1_12-03-2019.pdf 2019-03-12
19 Form2 Title Page_Complete_27-09-2019.pdf 2019-09-27
20 Form1_After Filing_12-03-2019.pdf 2019-03-12
20 Description Complete_After Provisional_27-09-2019.pdf 2019-09-27
20 Correspondence By Applicant_Form1_12-03-2019.pdf 2019-03-12
20 Correspondence by Applicant_After Provisional_27-09-2019.pdf 2019-09-27
20 201841036688-FER.pdf 2021-10-17
21 201841036688-OTHERS [27-12-2021(online)].pdf 2021-12-27
21 Correspondence by Applicant_As Filed_28-09-2018.pdf 2018-09-28
21 Description Complete_After Provisional_27-09-2019.pdf 2019-09-27
21 Drawing_After Provisional_27-09-2019.pdf 2019-09-27
21 Form1_After Filing_12-03-2019.pdf 2019-03-12
22 Form-1_After Provisional_27-09-2019.pdf 2019-09-27
22 Drawing_After Provisional_27-09-2019.pdf 2019-09-27
22 Description Provisional_As Filed_28-09-2018.pdf 2018-09-28
22 Correspondence by Applicant_As Filed_28-09-2018.pdf 2018-09-28
22 201841036688-FER_SER_REPLY [27-12-2021(online)].pdf 2021-12-27
23 201841036688-CORRESPONDENCE [27-12-2021(online)].pdf 2021-12-27
23 Description Provisional_As Filed_28-09-2018.pdf 2018-09-28
23 Form-1_After Provisional_27-09-2019.pdf 2019-09-27
23 Form1_As Filed_28-09-2018.pdf 2018-09-28
23 Form2 Title Page_Complete_27-09-2019.pdf 2019-09-27
24 201841036688-COMPLETE SPECIFICATION [27-12-2021(online)].pdf 2021-12-27
24 Correspondence By Applicant_Form1_12-03-2019.pdf 2019-03-12
24 Form1_As Filed_28-09-2018.pdf 2018-09-28
24 Form2 Title Page_Complete_27-09-2019.pdf 2019-09-27
24 Form3_As Filed_28-09-2018.pdf 2018-09-28
25 201841036688-CLAIMS [27-12-2021(online)].pdf 2021-12-27
25 Correspondence By Applicant_Form1_12-03-2019.pdf 2019-03-12
25 Form1_After Filing_12-03-2019.pdf 2019-03-12
25 Form3_As Filed_28-09-2018.pdf 2018-09-28
25 Form5_As Filed_28-09-2018.pdf 2018-09-28
26 201841036688-US(14)-HearingNotice-(HearingDate-09-12-2024).pdf 2024-11-12
26 Correspondence by Applicant_As Filed_28-09-2018.pdf 2018-09-28
26 Form1_After Filing_12-03-2019.pdf 2019-03-12
26 Form5_As Filed_28-09-2018.pdf 2018-09-28
27 Description Provisional_As Filed_28-09-2018.pdf 2018-09-28
27 Correspondence by Applicant_As Filed_28-09-2018.pdf 2018-09-28
27 201841036688-RELEVANT DOCUMENTS [04-02-2025(online)].pdf 2025-02-04
28 Form1_As Filed_28-09-2018.pdf 2018-09-28
28 Description Provisional_As Filed_28-09-2018.pdf 2018-09-28
28 201841036688-MARKED COPIES OF AMENDEMENTS [04-02-2025(online)].pdf 2025-02-04
29 201841036688-FORM 13 [04-02-2025(online)].pdf 2025-02-04
29 Form1_As Filed_28-09-2018.pdf 2018-09-28
29 Form3_As Filed_28-09-2018.pdf 2018-09-28
30 201841036688-AMENDED DOCUMENTS [04-02-2025(online)].pdf 2025-02-04
30 Form3_As Filed_28-09-2018.pdf 2018-09-28
30 Form5_As Filed_28-09-2018.pdf 2018-09-28
31 201841036688-Correspondence to notify the Controller [20-03-2025(online)].pdf 2025-03-20
31 Form5_As Filed_28-09-2018.pdf 2018-09-28

Search Strategy

1 search-opticalcharacterrecognitionE_10-06-2021.pdf