Abstract: The present disclosure relates to method and system for classification of web browsing history by classification system. The classification system receives web browsing history from web browser associated with user, where web browsing history comprises details about one or more web pages browsed by user, extracts one or more keywords from each of one or more web pages browsed by user based on trained keyword dataset, determines a plurality of classifications for each of the one or more web pages based on one or more keywords, generates relevancy matrix between one or more keywords of web pages and corresponding plurality of classifications and identifies a classification from plurality of classifications for each of one or more webpages based on relevancy matrix, where snapshot of classification is stored in non-volatile storage unit of web browser. The use of non-volatile storage unit in present disclosure provides no restriction on storage space. FIG.1
Claims:WE CLAIM:
1. A method for classification of web browsing history, the method comprising:
receiving, by a classification system (101), the web browsing history from a web browser (115) associated with a user, wherein the web browsing history comprises details about one or more web pages browsed by the user;
extracting, by the classification system (101), one or more keywords from each of the one or more web pages browsed by the user based on a trained keyword dataset;
determining, by the classification system (101), a plurality of classifications for each of the one or more web pages based on the one or more keywords;
generating, by the classification system (101), a relevancy matrix between each of the one or more keywords of the one or more web pages and the corresponding plurality of classifications; and
identifying, by the classification system (101), a classification from the plurality of classifications for each of the one or more webpages based on the relevancy matrix, wherein a snapshot of the classification is stored in a non-volatile storage unit (117) of the web browser (115).
2. The method as claimed in claim 1, further comprising identifying a classification for the one or more web pages browsed by the user based on the snapshot of the classification stored in the non-volatile storage unit of the web browser.
3. The method as claimed in claim 1, wherein the details about the one or more web pages comprises metadata of the web page, data regarding header, Uniform Resource Locator (URL), title and time stamp in the one or more web pages browsed by the user.
4. The method as claimed in claim 1, wherein the web browsing history is received from the web browser at pre-defined time intervals.
5. The method as claimed in claim 1, wherein extracting the one or more keywords comprises:
providing, by the classification system, the details about the one or more web pages to a trained keyword extraction model, wherein the trained keyword extraction model is trained using a plurality of training web page dataset; and
extracting, by the classification system, the one or more keywords for each of the one or more web pages based on analysis of the trained keyword extraction model.
6. The method as claimed in claim 1, wherein the plurality of classifications associated with each of the one or more webpages are determined by identifying common characteristics between the one or more keywords and parameters of each web browsing path using a trained classification dataset.
7. The method as claimed in claim 1, wherein the trained classification dataset comprises one or more classification identified previously based on the web browsing history of the user.
8. The method as claimed in claim 1 further comprising adding a user defined keyword to the trained keyword dataset.
9. The method as claimed in claim 1, wherein the relevancy matrix is generated by calculating a cosine similarity score between the one or more keywords of the one or more web pages and the corresponding plurality of classifications.
10. The method as claimed in claim 1, wherein the snapshot of the classification comprises details about time stamp, URL, title, cosine similarity score and relevancy matrix for each of the one or more web pages.
11. A classification system (101) for classification of web browsing history, comprising:
a processor (113); and
a memory (111) communicatively coupled to the processor (113), wherein the memory (111) stores processor instructions, which, on execution, causes the processor (113) to:
receive the web browsing history from a web browser (115) associated with a user, wherein the web browsing history comprises details about one or more web pages browsed by the user;
extract one or more keywords from each of the one or more web pages browsed by the user based on a trained keyword dataset;
determine a plurality of classifications for each of the one or more web pages based on the one or more keywords;
generate a relevancy matrix between each of the one or more keywords of the one or more web pages and the corresponding plurality of classifications; and
identify a classification from the plurality of classifications for each of the one or more webpages based on the relevancy matrix, wherein a snapshot of the classification is stored in a non-volatile storage unit (117) of the web browser (115).
12. The classification system (101) as claimed in claim 11, wherein the processor identifies a classification for the one or more web pages browsed by the user based on the snapshot of the classification stored in the non-volatile storage unit of the web browser.
13. The classification system (101) as claimed in claim 11, wherein the details about the one or more web pages comprises metadata of the web page, data regarding header, Uniform Resource Locator (URL), title and time stamp in the one or more web pages browsed by the user.
14. The classification system (101) as claimed in claim 11, wherein the processor receives the web browsing history from the web browser at pre-defined time intervals.
15. The classification system (101) as claimed in claim 11, wherein the processor extracts the one or more keywords by:
providing the details about the one or more web pages to a trained keyword extraction model, wherein the trained keyword extraction model is trained using a plurality of training web page dataset; and
extracting the one or more keywords for each of the one or more web pages based on analysis of the trained keyword extraction model.
16. The classification system (101) as claimed in claim 11, wherein the processor determines the plurality of classifications associated with each of the one or more webpages by identifying common characteristics between the one or more keywords and parameters of each web browsing path using a trained classification dataset.
17. The classification system (101) as claimed in claim 11, wherein the trained classification dataset comprises one or more classification identified previously based on the web browsing history of the user.
18. The classification system (101) as claimed in claim 11, wherein the processor adds a user defined keyword to the trained keyword dataset.
19. The classification system (101) as claimed in claim 11, wherein the processor generates the relevancy matrix by calculating a cosine similarity score between the one or more keywords of the one or more web pages and the corresponding plurality of classifications.
20. The classification system (101) as claimed in claim 11, wherein the snapshot of the classification comprises details about time stamp, URL, title, cosine similarity score and relevancy matrix for each of the one or more web pages.
Dated this 9th day of March, 2017
R Ramya Rao
Of K&S Partners
Agent for the Applicant
, Description:TECHNICAL FIELD
The present subject matter is related in general to classification system, more particularly, but not exclusively, to a method and system for classification of web browsing history.
| # | Name | Date |
|---|---|---|
| 1 | Power of Attorney [09-03-2017(online)].pdf | 2017-03-09 |
| 2 | Form 5 [09-03-2017(online)].pdf | 2017-03-09 |
| 3 | Form 3 [09-03-2017(online)].pdf | 2017-03-09 |
| 4 | Form 18 [09-03-2017(online)].pdf_255.pdf | 2017-03-09 |
| 5 | Form 18 [09-03-2017(online)].pdf | 2017-03-09 |
| 6 | Form 1 [09-03-2017(online)].pdf | 2017-03-09 |
| 7 | Drawing [09-03-2017(online)].pdf | 2017-03-09 |
| 8 | Description(Complete) [09-03-2017(online)].pdf_254.pdf | 2017-03-09 |
| 9 | Description(Complete) [09-03-2017(online)].pdf | 2017-03-09 |
| 10 | REQUEST FOR CERTIFIED COPY [10-03-2017(online)].pdf | 2017-03-10 |
| 11 | 201741008293-REQUEST FOR CERTIFIED COPY [14-07-2017(online)].pdf | 2017-07-14 |
| 12 | 201741008293-Proof of Right (MANDATORY) [12-12-2017(online)].pdf | 2017-12-12 |
| 13 | Correspondence by Agent_Form 1_15-12-2017.pdf | 2017-12-15 |
| 14 | abstract 201741008293 .jpg | 2017-12-19 |
| 15 | 201741008293-FER.pdf | 2020-04-21 |
| 16 | 201741008293-RELEVANT DOCUMENTS [21-10-2020(online)].pdf | 2020-10-21 |
| 17 | 201741008293-RELEVANT DOCUMENTS [21-10-2020(online)]-1.pdf | 2020-10-21 |
| 18 | 201741008293-PETITION UNDER RULE 137 [21-10-2020(online)].pdf | 2020-10-21 |
| 19 | 201741008293-PETITION UNDER RULE 137 [21-10-2020(online)]-1.pdf | 2020-10-21 |
| 20 | 201741008293-OTHERS [21-10-2020(online)].pdf | 2020-10-21 |
| 21 | 201741008293-Information under section 8(2) [21-10-2020(online)].pdf | 2020-10-21 |
| 22 | 201741008293-FORM 3 [21-10-2020(online)].pdf | 2020-10-21 |
| 23 | 201741008293-FER_SER_REPLY [21-10-2020(online)].pdf | 2020-10-21 |
| 24 | 201741008293-DRAWING [21-10-2020(online)].pdf | 2020-10-21 |
| 25 | 201741008293-CORRESPONDENCE [21-10-2020(online)].pdf | 2020-10-21 |
| 26 | 201741008293-COMPLETE SPECIFICATION [21-10-2020(online)].pdf | 2020-10-21 |
| 27 | 201741008293-CLAIMS [21-10-2020(online)].pdf | 2020-10-21 |
| 28 | 201741008293-ABSTRACT [21-10-2020(online)].pdf | 2020-10-21 |
| 29 | 201741008293-US(14)-HearingNotice-(HearingDate-22-12-2022).pdf | 2022-11-21 |
| 30 | 201741008293-POA [01-12-2022(online)].pdf | 2022-12-01 |
| 31 | 201741008293-FORM 13 [01-12-2022(online)].pdf | 2022-12-01 |
| 32 | 201741008293-Correspondence to notify the Controller [01-12-2022(online)].pdf | 2022-12-01 |
| 33 | 201741008293-AMENDED DOCUMENTS [01-12-2022(online)].pdf | 2022-12-01 |
| 34 | 201741008293-Written submissions and relevant documents [06-01-2023(online)].pdf | 2023-01-06 |
| 35 | 201741008293-PatentCertificate09-03-2023.pdf | 2023-03-09 |
| 36 | 201741008293-IntimationOfGrant09-03-2023.pdf | 2023-03-09 |
| 1 | srchE_13-03-2020.pdf |