Abstract: A method and system of classifying documents is disclosed. The method includes determining line-text data for each of a plurality of lines of the document image using a text extraction technique. A set of unique keywords in the document image is determined based on a predefined list of keywords in the line-text data for each of the plurality of lines. For each keyword, two forward nodes are determined as next two subsequent keywords in the set of unique keywords based on determination of a shortest distance between the corresponding keyword and the next two subsequent keywords and based on the pre-defined reading sequence. Weights of each of the two forward nodes are determined based on the shortest distance and an angle of each of the two forward nodes with the corresponding keyword. A cluster is determined based on the feature matrix using a machine learning clustering model. [To be published with FIG. 1]
Description:PLEASE REFER THE ATTACHMENT , Claims:PLEASE REFER THE ATTACHMENT
| # | Name | Date |
|---|---|---|
| 1 | 202341039663-STATEMENT OF UNDERTAKING (FORM 3) [09-06-2023(online)].pdf | 2023-06-09 |
| 2 | 202341039663-REQUEST FOR EXAMINATION (FORM-18) [09-06-2023(online)].pdf | 2023-06-09 |
| 3 | 202341039663-PROOF OF RIGHT [09-06-2023(online)].pdf | 2023-06-09 |
| 4 | 202341039663-POWER OF AUTHORITY [09-06-2023(online)].pdf | 2023-06-09 |
| 5 | 202341039663-FORM 18 [09-06-2023(online)].pdf | 2023-06-09 |
| 6 | 202341039663-FORM 1 [09-06-2023(online)].pdf | 2023-06-09 |
| 7 | 202341039663-DRAWINGS [09-06-2023(online)].pdf | 2023-06-09 |
| 8 | 202341039663-DECLARATION OF INVENTORSHIP (FORM 5) [09-06-2023(online)].pdf | 2023-06-09 |
| 9 | 202341039663-COMPLETE SPECIFICATION [09-06-2023(online)].pdf | 2023-06-09 |
| 10 | 202341039663-Form 1 (Submitted on date of filing) [08-12-2023(online)].pdf | 2023-12-08 |
| 11 | 202341039663-Covering Letter [08-12-2023(online)].pdf | 2023-12-08 |
| 12 | 202341039663-FORM 3 [18-04-2024(online)].pdf | 2024-04-18 |