Sign In to Follow Application
View All Documents & Correspondence

Method And System For Analysing Domain Specific Documents

Abstract: A method (600) and system (100) of analysing domain-specific documents is disclosed. A processor (104) determines one or more attributes in a first text data extracted from the plurality of domain-specific documents based on a first set of queries (400A) using a first generative machine learning (ML) model. A second set of queries (400B) are determined based portions of the first text data corresponding to the attributes using a second generative ML model. A third set of queries (400C) are determined based on domain type of the domain-specific documents and a second text data extracted for a co0rresponding domain-specific document from the domain-specific documents. A fourth set of queries (400D) are determined based on an iterative comparison between the second set of queries (400B) and the third set of queries (400C) until the fourth set of queries (400D) are determined as about same as second set of queries (400B). [Fig.1]

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #
Filing Date
28 November 2023
Publication Number
22/2025
Publication Type
INA
Invention Field
COMPUTER SCIENCE
Status
Email
Parent Application

Applicants

L&T TECHNOLOGY SERVICES LIMITED
DLF IT SEZ Park, 2nd Floor – Block 3, 1/124, Mount Poonamallee Road, Ramapuram, Chennai - 600 089, Tamil Nadu, India.

Inventors

1. PINAK PANI GOGOI
C-3, Vars Residency, 3rd Main, Bhuvaneshwari Nagar, Bangalore, Karnataka, India – 560093.
2. ASHWIN KANTH
S-9, Sheeba Nilaya, 1st Cross Road, Telecom Layout, Thanisandra, Bangalore, Karnataka, India – 560077.
3. PRAKHAR SRIVASTAVA
# 4-C, Jawahar Lal Nehru Road, Tagore Town, Prayagraj, Uttar Pradesh – 211002, India.
4. MADHUSUDAN SINGH
B-603, Ajmera Stone Park, 1st Cross, Neeladri Road, Electronic City-1, Bangalore, Karnataka, India - 560100.

Specification

1. A method (600) of analysing a plurality of domain-specific documents, the method (600)
comprising:
determining (602), by a processor (104), one or more attributes in a first text
data extracted from the plurality of domain-specific documents based on a first set of
queries (400A) using a first generative machine learning model,
determining (604), by the processor (104), a second set of queries (400B) based
on one or more portions of the first text data corresponding to the one or more attributes
using a second generative machine learning model;
for each of a portion of the plurality of domain-specific documents:
determining (606), by the processor (104), a third set of queries (400C) based
on a domain type of the plurality of domain-specific documents and a second text data
extracted for a corresponding domain-specific document from the portion of the
plurality of domain-specific documents, using a third generative machine learning
model,
wherein the domain type is determined based on a subject matter
analysis of the plurality of domain-specific documents; and
determining (608), by the processor (104), a fourth set of queries (400D) based
on a comparison between the second set of queries (400B) and the third set of queries
(400C),
wherein the first set of queries (400A) are iteratively updated based on
the corresponding fourth set of queries (400D) determined for each of the
corresponding domain-specific document from the portion of the plurality of
domain-specific documents; and
determining (610), by the processor (104), a final set of queries corresponding to the
plurality of domain-specific documents based on the fourth set of queries (400D) determined
as about same as the second set of queries (400B).
2. The method (600) as claimed in claim 1, comprising:
analysing, by the processor (104), the plurality of domain-specific documents by:
querying, by the processor (104), a fourth generative machine learning model
using the final set of queries; and
16
displaying, by the processor (104), an answer text data output by the fourth
generative machine learning model for each query of the final set of queries based on
the first text data.
3. The method (600) as claimed in claim 1, wherein the second set of queries (400B) are
determined based on validation of the one or more portions of the first text data corresponding
to the one or more attributes based on a user input.
4. The method (600) as claimed in claim 1, wherein the first text data and the second text data
are extracted using a text extraction technique.
5. The method (600) as claimed in claim 1, wherein the comparison between the second set of
queries (400B) and the third set of queries (400C) is based on a comparison of character
embeddings and position vector of each character in a query text data of each query of the
second set of queries (400B) and the third set of queries (400C).
6. A system (100) of analysing a plurality of domain-specific documents, comprising:
a processor (104); and
a memory (106) communicably coupled to the processor, wherein the memory (106)
stores processor-executable instructions, which when executed by the processor (104), cause
the processor (104) to:
determine a first set of queries (400A) based on one or more attributes in a first text
data extracted from the plurality of domain-specific documents based on a first set of queries
(400A) using a first generative machine learning model,
determine a second set of queries (400B) based on one or more portions of the first text
data corresponding to the one or more attributes using a second generative machine learning
model;
for each of a portion of the plurality of domain-specific documents:
determine a third set of queries (400C) based on a domain type of the plurality
of domain-specific documents and a second text data extracted for a corresponding
domain-specific document from the portion of the plurality of domain-specific
documents,
wherein the domain type is determined based on a subject matter
analysis of the plurality of domain-specific documents; and
17
determine a fourth set of queries (400D) based on a comparison between the
second set of queries (400B) and the third set of queries (400C),
wherein the first set of queries (400A) are iteratively updated based on
the corresponding fourth set of queries (400D) determined for each of the
corresponding domain-specific document from the portion of the plurality of
domain-specific documents; and
determine a final set of queries corresponding to the plurality of domain-specific
documents based on the fourth set of queries (400D) determined as about same as the
second set of queries (400B).
7. The system (100) as claimed in claim 6, wherein the processor (104) is configured to:
analyze the plurality of domain-specific documents based on:
query a fourth generative machine learning model using the final set of queries;
and
display an answer text data output by the fourth generative machine learning
model for each query of the final set of queries based on the first text data.
8. The system (100) as claimed in claim 6, wherein the second set of queries (400B) are
determined based on validation of the one or more portions of the first text data corresponding
to the one or more attributes based on a user input.
9. The system (100) as claimed in claim 6, wherein the first text data and the second text data
are extracted using a text extraction technique.
10. The system (100) as claimed in claim 6, wherein the comparison between the second set of
queries (400B) and the third set of queries (400C) is based on a comparison of character
embeddings and position vector of each character in a query text data of each query of the
second set of queries (400B) and the third set of queries (400C).

Documents

Application Documents

# Name Date
1 202341080898-STATEMENT OF UNDERTAKING (FORM 3) [28-11-2023(online)].pdf 2023-11-28
2 202341080898-REQUEST FOR EXAMINATION (FORM-18) [28-11-2023(online)].pdf 2023-11-28
3 202341080898-PROOF OF RIGHT [28-11-2023(online)].pdf 2023-11-28
4 202341080898-POWER OF AUTHORITY [28-11-2023(online)].pdf 2023-11-28
5 202341080898-FORM 18 [28-11-2023(online)].pdf 2023-11-28
6 202341080898-FORM 1 [28-11-2023(online)].pdf 2023-11-28
7 202341080898-DRAWINGS [28-11-2023(online)].pdf 2023-11-28
8 202341080898-DECLARATION OF INVENTORSHIP (FORM 5) [28-11-2023(online)].pdf 2023-11-28
9 202341080898-COMPLETE SPECIFICATION [28-11-2023(online)].pdf 2023-11-28