Abstract: The invention relates to a method for processing a user document for identifying and reporting deviations in a business document from standards and policies of organizations. In one embodiment, the method receives at least one primary document or a query text from at least one user for scrutinizing or check over one or more information. Reads the primary document to process the information to extract characteristics or features associated with the information into one or more useful text. Searches in a repository of secondary documents based on the one or more extracted text in the form of keywords. Retrieves one or more most suitable secondary document draft from the repository, wherein retrieving the secondary document is based on sentence to sentence matching or paragraph to paragraph matching or contextual matching with the primary document. Compares the primary document text with the secondary document text in order to highlight one or more sections where a business approval is required and provides a customized relevant and contextual match draft by highlighting suitable changes on the primary document text as per the secondary document text along with comments of the user, if any, in a track change mode. FIG. 2
Claims:We Claim:
1. A computer-implemented method for processing a user document for identifying and reporting deviations in a business document from standards and policies of organizations, comprising:
receiving at least one primary document or a query text from at least one user for scrutinizing or check over one or more information;
reading the primary document to process the information to extract characteristics or features associated with the information into one or more useful text;
searching in a repository of secondary documents based on the one or more extracted text in the form of keywords;
retrieving one or more most suitable secondary document draft from the repository, wherein retrieving the secondary document is based on sentence to sentence matching or paragraph to paragraph matching or contextual matching with the primary document;
comparing the primary document text with the secondary document text in order to highlight one or more sections where a business approval is required; and
providing a customized relevant and contextual match draft by highlighting suitable changes on the primary document text as per the secondary document text along with comments of the user, if any, in a track change mode.
2. The method of claim 1, further comprising:
storing the draft in the repository with a identification number and name to the draft for future reference.
3. The method of claim 1, wherein the secondary document includes one or more of vetted or standard documents in respective subjects or domains which is used for reference in business operations.
4. The method of claim 1, wherein the customized draft generated based upon one of the stored plurality of secondary documents, the customized draft including a plurality of clauses and business terms of the operations.
5. The method of claim 1, wherein the customized draft that do conform to the one or more standards or policies of the organization.
6. The method of claim 1, further comprising:
identifying and classifying the nature of the primary document in order to retrieve the relevant secondary document from the associated repository.
7. The method of claim 1, wherein processing of the primary document is by a natural language process.
8. A system for processing a user document for identifying and reporting deviations in a business document from standards and policies of organizations, the system comprising:
at least one processor;
at least one data storage device coupled to the at least one processor;
a plurality of processes spawned by said at least one processor, the processes including processing logic for:
receiving at least one primary document or a query text from at least one user for scrutinizing or check over one or more information;
reading the primary document to process the information and to extract characteristics or features associated with the information into one or more useful text;
searching in a repository of secondary documents based on the one or more extracted text in the form of keywords;
retrieving one or more most suitable secondary document draft from the repository, wherein retrieving the secondary document is based on sentence to sentence matching or paragraph to paragraph matching or contextual matching with the primary document;
comparing the primary document text with the secondary document text in order to highlight one or more sections where a business approval is required; and
providing a customized relevant and contextual match draft by highlighting suitable changes on the primary document text as per the secondary document text along with comments of the user, if any, in a track change mode.
9. The system of claim 8, wherein the system is configured to be implemented on a cloud network.
10. The system of claim 8, further comprises executable program instructions in a memory to be executed for identifying and reporting deviations in a business document from standards and policies of organizations.
11. A system of claim 8, further comprising:
a graphical user interface enabled through a Web browser for at least one user to input one or more query and information characterizing the type of documents to be retrieved, wherein the query and the information represent a document search profile.
, Description:FORM 2
THE PATENTS ACT, 1970
(39 of 1970)
&
THE PATENTS RULES, 2003
COMPLETE SPECIFICATION
(See section 10, rule 13)
“SureMatch”
By
Impactsure Technologies LLP
An Indian Company
C 804, Runwal Heights, LBS Road, Opp to Nimal Life Styles,
Mulund West, Mumbai, Maharashtra 400080
The following specification particularly describes the invention and the manner in which it is to be performed.
Field of the Invention
The invention relates generally to document search, document review, document comparison, efficient document modification, and/or merging of documents, and to various scenarios based on reviewing, comparing, efficiently modifying, and/or merging documents.
Background of the Invention
Financial Institutions (FIs) or Banks conventionally have many financial or business documents that they use to define financial or business relationships with their customers, vendors and partners. Most FIs or banks have business practices that author standard clauses and templates for standard business documents (such as, for example, contracts, licenses, purchase orders, quotes, lease agreements and the like) that are approved for use in various types of contracts. When banks negotiate contracts or author other types of business documents, they try to use the standard language whenever possible, both to promote uniformity in their business relations and to reduce legal liability by using only approved and vetted language. Because some deals or other bank undertakings do not lend themselves to standard boilerplate language, the need remains for authoring custom, non-standard clauses for use in financial documents. Often this is done as a result of negotiations carried out by a representative of the bank or company as he or she drives the deal forward. Non-standard clauses, however, must be carefully reviewed by business and legal personnel to ensure that they are correct and appropriate for the situation, primarily to evaluate risk and potential revenue recognition implications.
In general, both standard and non-standard financial or business documents are often stored as electronic documents on a file server, and sometimes as hard copies in filing cabinets. It is, however, difficult for FIs or banks to fully understand the overall impact of such non-standard contract language in these contracts, both for individual documents and in the aggregate across all business documents. It is also difficult for FIs or banks to analyze how often standard clauses are being modified, and which standard clauses are being most often modified or to analyze the frequency with which new custom clauses are being drafted and included in financial or business documents.
Indeed, there exists no comprehensive mechanism for analyzing the language of such financial or business documents to identify and assess risk and to improve the FI or bank’s library of standard clauses. In addition, if a FI or bank finds an issue with the language in a particular standard clause of a business document, it is very difficult to determine which other business documents may have used that problematic clause and are, therefore, potentially impacted. Moreover, there exists no mechanism to enable FIs or banks to evaluate risk exposure across their contract base (e.g., the ability to identify all contracts having non-standard liability clauses, intellectual property clauses, or contracts using a particular standard clause whose language has some issues). Also, new types of clauses may need to be included in the standard templates, as new clauses are often drafted ad hoc during the contracting process.
To summarize the above issues, the FIs or banks does not have any solution approach to re-use already vetted documents, further the existing mechanism does not provide any viable solution due to limited keyword search. Furthermore, in the existing mechanism has the problem in searching through multiple files for contextual language which leads to time consuming due to repetitive process.
Summary of the Invention
The inventive concepts presented herein are illustrated in a number of different embodiments, each showing one or more concepts, though it should be understood that, in general, the concepts are not mutually exclusive and may be used in combination even when not so illustrated.
Accordingly, in one aspect of the present invention provides a computer-implemented method for processing a user document for identifying and reporting deviations in a business document from standards and policies of organizations. The method includes the steps of receiving at least one primary document or a query text from at least one user for scrutinizing or check over one or more information, reading the primary document to process the information to extract characteristics or features associated with the information into one or more useful text, searching in a repository of secondary documents based on the one or more extracted text in the form of keywords, retrieving one or more most suitable secondary document draft from the repository, wherein retrieving the secondary document is based on sentence to sentence matching or paragraph to paragraph matching or contextual matching with the primary document, comparing the primary document text with the secondary document text in order to highlight one or more sections where the a business approval is required and providing a customized relevant and contextual match draft by highlighting suitable changes on the primary document text as per the secondary document text along with comments of the user, if any, in a track change mode.
In another aspect of the present invention provides a system for processing a user document for identifying and reporting deviations in a business document from standards and policies of organizations. The system includes at least one processor, at least one data storage device coupled to the at least one processor, a plurality of processes spawned by said at least one processor, the processes including processing logic for: receiving at least one primary document or a query text from at least one user for scrutinizing or check over one or more information, reading the primary document to process the information and to extract characteristics or features associated with the information into one or more useful text, searching in a repository of secondary documents based on the one or more extracted text in the form of keywords, retrieving one or more most suitable secondary document draft from the repository, wherein retrieving the secondary document is based on sentence to sentence matching or paragraph to paragraph matching or contextual matching with the primary document, comparing the primary document text with the secondary document text in order to highlight one or more sections where the a business approval is required and providing a customized relevant and contextual match draft by highlighting suitable changes on the primary document text as per the secondary document text along with comments of the user, if any, in a track change mode.
To further clarify advantages and features of the present invention, a more particular description of the invention will be rendered by reference to specific embodiments thereof, which is illustrated in the appended figures. It is appreciated that these figures depict only typical embodiments of the invention and are therefore not to be considered limiting of its scope.
Brief description of the drawings
The invention will be described and explained with additional specificity and detail with the accompanying figures in which:
FIG. 1 is a top-level block diagram illustrating an exemplary embodiment of a document processing system that processes electronic documents.
FIG. 2 is a flow chart of a computer-implemented method for processing a user document for identifying and reporting deviations in a business document from standards and policies of organizations, according to one embodiment of the present invention.
FIG. 3 shows process flow of the method of FIG. 2, in accordance with one embodiment of the present invention.
FIG. 4 is a block diagram representing an exemplary non-limiting computing system or operating environment in which one or more aspects of various embodiments described herein can be implemented
Further, skilled artisans will appreciate that elements in the figures are illustrated for simplicity and may not have been necessarily been drawn to scale. Furthermore, in terms of the construction of the device, one or more components of the device may have been represented in the figures by conventional symbols, and the figures may show only those specific details that are pertinent to understanding the embodiments of the present invention so as not to obscure the figures with details that will be readily apparent to those of ordinary skill in the art having benefit of the description herein.
Description of the Invention
For the purpose of promoting an understanding of the principles of the invention, reference will now be made to the embodiment illustrated in the figures and specific language will be used to describe the same. It will nevertheless be understood that no limitation of the scope of the invention is thereby intended, such alterations and further modifications in the illustrated system, and such further applications of the principles of the invention as illustrated therein being contemplated as would normally occur to one skilled in the art to which the invention relates.
It will be understood by those skilled in the art that the foregoing general description and the following detailed description are exemplary and explanatory of the invention and are not intended to be restrictive thereof. The terms "comprises", "comprising", or any other variations thereof, are intended to cover a non-exclusive inclusion, such that a process or method that comprises a list of steps does not include only those steps but may include other steps not expressly listed or inherent to such process or method. Similarly, one or more devices or sub-systems or elements or structures or components proceeded by "comprises... a" does not, without more constraints, preclude the existence of other devices or other sub-systems or other elements or other structures or other components or additional devices or additional sub-systems or additional elements or additional structures or additional components. Appearances of the phrase "in an embodiment", "in another embodiment" and similar language throughout this specification may, but do not necessarily, all refer to the same embodiment.
It will be understood that when an element or layer is referred to as being “on,” “connected to,” or “coupled to” another element or layer, it can be directly on, connected to, or coupled to the other element or layer or intervening elements or layers that may be present. As used herein, the term “and/or” includes any and all combinations of one or more of the associated listed items.
Spatially relative terms, such as “detection,” or “capture, and the like, may be used herein for ease of description to describe one element or feature’s relationship to another element(s) or feature(s) as illustrated in the figures. It will be understood that the spatially relative terms are intended to encompass different orientations of the structure in use or operation in addition to the orientation depicted in the figures.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The system, methods, and examples provided herein are illustrative only and not intended to be limiting.
Embodiments of the present invention will be described below in detail with reference to the accompanying figures.
The present invention method provides a process compliance by complying with standard vetted language always. Further, the method provides quick results along with relevant and contextual within specific kinds of documents. Also, it provides results by determining correlation and anti-correlation information regarding the common presence or absence of specific pairs of provisions or pairs of documents. Furthermore, the method provides recognition functionality so that provisions in previously unsurveyed documents may be recognized as similar or identical to common textual patterns determined by the survey functionality. Also, the method reduces the search time & improve processing time by over 65%, where the automation of the method allows focused vetting for more documents. Furthermore, the method is capable of configuring multiple parameters for matching to give high confidence results, i.e. match more closely to common textual patterns determined by the survey functionality. Moreover, the method integrates with a document management system.
FIG. 1, the document processing system 100 is shown as receiving one or more incoming documents 110 and provides corresponding processed documents 120. The incoming documents 110 can be provided from any conventional document source. Exemplary incoming documents 110 can include documents that are received an electronically generated document, as a printed document, a facsimile document, an emailed document, and/or a mailed document. The document processing system 100 can comprise any conventional type of processing system, such as a computer system.
Each incoming document 110 includes at least one section of textual content and can be received in any conventional manner, including in a paper form and/or an electronic form. If provided as a paper-based document, the incoming document 110 preferably is imaged to convert the paper-based document into an imaged document having a suitable electronic form. The paper-based document can be converted into the imaged document in any conventional manner, such as via commercially-available scanning software. Many categories of documents, such as legal, accounting, and/or tax documents, often include one or more common passages with the same, or similar, textual content. For example, standard form contracts often include a plurality of boilerplate (or miscellaneous) provisions. The boilerplate provisions can be set forth at different locations and/or in different orders in the different contracts. The document processing system 100 therefore can facilitate the review of the incoming document 110 by identifying and classifying common text content between the incoming document 110 and at least one reference document, such as a second incoming document.
FIG. 2 is a flow chart of a computer-implemented method for processing a user document for identifying and reporting deviations in a business document from standards and policies of organizations, according to one embodiment of the present invention.
At step 210, the method receives at least one primary document or a query text from at least one user for scrutinizing or check over one or more information.
At step 220, the method reads the primary document to process the information to extract characteristics or features associated with the information into one or more useful text. In an embodiment, the method process the primary document is by a natural language process.
At step 230, the method searches in a repository of secondary documents based on the one or more extracted text in the form of keywords.
At step 240, the method retrieves one or more most suitable secondary document draft from the repository, wherein retrieving the secondary document is based on sentence to sentence matching or paragraph to paragraph matching or contextual matching with the primary document. In one embodiment the secondary document includes one or more of vetted or standard documents in respective subjects or domains which is used for reference in business operations. In an example embodiment, the method identifies and classifies the nature of the primary document in order to retrieve the relevant secondary document from the associated repository
At step 250, the method compares the primary document text with the secondary document text in order to highlight one or more sections where a business approval is required.
At step 260, the method provides a customized relevant and contextual match draft by highlighting suitable changes on the primary document text as per the secondary document text along with comments of the user, if any, in a track change mode. Moreover, the method further stores the draft in the repository with a identification number and name to the draft for future reference. In one embodiment, the customized draft generated based upon one of the stored plurality of secondary documents, the customized draft including a plurality of clauses and business terms of the operations. Moreover, the customized draft that do conform to the one or more standards or policies of the organization. FIG. 3 shows a system process flow of the method of FIG. 2, in accordance with one embodiment of the present invention. In an example operation, the system receives input image at 310. In an example, the input image may be or may include word file, PNG file, JPEG file, PDF file which are received by uploading the same by an user. It is also possible that, the input image may be read directly from a file repository or a read file from a ticket. Upon receipt of the input image, the system image cleansing process 315 using a image process engine 360. The system further extracts the text 320 by extraction AI engine- neural networks 365, where characteristics or features associated of the received file information into one or more useful text. Further, the system classifies at 325 using a classification engine 370. In an embodiment the system identifies and classifies the nature of the received file in order to retrieve the relevant one or more similar documents from the associated repository. Upon classifying, the system compares 335 the entire document with two or more already vetted document in the repository 380. Upon comparison, the file will be assigned with a name and a set of keywords at 340. Further, at 345, the system does the language match-clause with the documents which are selected to compare from the repository. The language match-clause is executed by the natural language processing (NLP) engine 385. Furthermore, the system compare the best match at 350 in order to create a fresh or required documents or draft at 355 and further stores the same in a template/dictionary. In an example embodiment, the system prepare a final document which can be provided to the customer after carrying out required changes in the draft post business approval for any onerous and other clauses which are required to be taken up with business as per pre-approved models. To highlight the changes that are made in the document includes comments of the users in the vetted documents in the repository. To have inbuilt functions of “Dictionary” of Legal Terminology to enable the user to understand the meaning of the same. In an example embodiment, in order to identify and highlight important clauses in the document like “Assignment”, “Auto Renewal”, “Open Ended BG”, “Arbitration BG” etc.
FIG. 4 thus illustrates an example of a suitable computing system environment 400 in which one or more of the embodiments may be implemented, although as made clear above, the computing system environment 400 is only one example of a suitable computing environment and is not intended to suggest any limitation as to the scope of use or functionality of any of the embodiments. The computing environment 400 is not to be interpreted as having any dependency or requirement relating to any one or combination of components illustrated in the exemplary operating environment 400.
With reference to FIG. 4, an exemplary remote device for implementing one or more embodiments herein can include a general purpose computing device in the form of a computer 410. Components of computer 410 may include, but are not limited to, a processing unit 420, a system memory 430, and a system bus 421 that couples various system components including the system memory to the processing unit 420.
Computer 410 typically includes a variety of computer readable media and can be any available media that can be accessed by computer 410. The system memory 430 may include computer storage media in the form of volatile and/or non-volatile memory such as read only memory (ROM) and/or random access memory (RAM). By way of example, and not limitation, memory 430 may also include an operating system, application programs, other program modules, and program data. In this regard, computer-readable media can be any available media that can be accessed by the computer including removable and non-removable media.
By way of example, and not limitation, computer-readable media can comprise computer storage media and/or communication media. Computer storage media can include both volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer-readable instructions, data structures, program modules or other data. Computer storage media includes, but is not limited to, RAM, ROM, Electrically Erasable Programmable Read-Only Memory (EEPROM), flash memory, universal serial bus (USB) drives, or other memory technology, compact disk (CD), digital versatile disk (DVD) or other optical disk storage, magnetic cassettes, magnetic tape, magnetic disk storage, or other magnetic storage devices, or any other medium which can be used to store information and which can be accessed by the computer.
Communication media typically embodies computer-readable instructions, data structures, program modules or other data in a modulated data signal such as a carrier wave or other transport mechanism, and includes any information delivery media. The term “modulated data signal” means a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal. By way of example, and not limitation, communication media includes wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, RF, infrared and other wireless media. Combinations of the any of the above should also be included within the scope of computer-readable media. A user may enter commands and information into the computer 410 through input devices 440. A monitor or other type of display device is also connected to the system bus 421 via an interface, such as output interface 450. In addition to a monitor, computers may also include other peripheral output devices such as speakers and a printer, which may be connected through output interface 450.
The computer 410 may operate in a networked or distributed environment using logical connections to one or more other remote computers, such as remote computer 470. The remote computer 470 may be a personal computer, a server, a router, a network PC, a peer device or other common network node, or any other remote media consumption or transmission device, and may include any or all of the elements described above relative to the computer 410. The logical connections depicted in FIG. 4 include a network 471, such local area network (LAN) or a wide area network (WAN), but may also include other networks/buses. Such networking environments are commonplace in homes, offices, enterprise-wide computer networks, intranets and the Internet.
As mentioned above, while exemplary embodiments have been described in connection with various computing devices, networks and memory architectures, the underlying concepts may be applied to any network system and any computing device or storage system in which it is desirable to search, review, compare, modify, replace, and/or merge data associated with media content as described herein.
There are multiple ways of implementing one or more of the embodiments described herein, e.g., an appropriate API, tool kit, driver code, operating system, control, standalone or downloadable software object, executable code, etc. which enables applications and services to use or access the various described embodiments from any platform. Embodiments may be contemplated from the standpoint of an API (or other software object), as well as from a software or hardware object that facilitates provision of disclosed techniques in accordance with one or more of the described embodiments. Various implementations and embodiments described herein may have aspects that are wholly in hardware, partly in hardware and partly in software, as well as in software.
The word “exemplary” is used herein to mean serving as an example, instance, or illustration. For the avoidance of doubt, the subject matter disclosed herein is not limited by such examples. In addition, any aspect or design described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other aspects or designs, nor is it meant to preclude equivalent exemplary structures and techniques known to those of ordinary skill in the art. Furthermore, to the extent that the terms “includes,” “has,” “contains,” and other similar words are used in either the detailed description or the claims, for the avoidance of doubt, such terms are intended to be inclusive in a manner similar to the term “comprising” as an open transition word without precluding any additional or other elements.
As mentioned, the various techniques described herein may be implemented in connection with hardware or software or, where appropriate, with a combination of both. As used herein, the terms “component,” “system” and the like are likewise intended to refer to a computer-related entity, either hardware, a combination of hardware and software, software, or software in execution. For example, a component may be, but is not limited to being, a process running on a processor, a processor, an object, an executable, a thread of execution, a program, and/or a computer. By way of illustration, both an application running on computer and the computer can be a component. One or more components may reside within a process and/or thread of execution and a component may be localized on one computer and/or distributed between two or more computers.
The aforementioned systems have been described with respect to interaction between several components. It can be appreciated that such systems and components can include those components or specified sub-components, some of the specified components or sub-components, and/or additional components, and according to various permutations and combinations of the foregoing. Sub-components can also be implemented as components communicatively coupled to other components rather than included within parent components (hierarchical). Additionally, it is noted that one or more components may be combined into a single component providing aggregate functionality or divided into several separate sub-components, and any one or more middle layers, such as a management layer, may be provided to communicatively couple to such sub-components in order to provide integrated functionality. Any components described herein may also interact with one or more other components not specifically described herein but generally known by those of skill in the art.
In view of the exemplary systems described supra, methodologies that may be implemented in accordance with the disclosed subject matter can be appreciated with reference to the flowcharts of the various figures. While for purposes of simplicity of explanation, the methodologies are shown and described as a series of blocks, it is to be understood and appreciated that the claimed subject matter is not limited by the order of the blocks, as some blocks may occur in different orders and/or concurrently with other blocks from what is depicted and described herein. Where non-sequential, or branched, flow is illustrated via flowchart, it can be appreciated that various other branches, flow paths, and orders of the blocks, may be implemented which achieve the same or a similar result. Moreover, not all illustrated blocks may be required to implement the methodologies described hereinafter.
While in some embodiments, a client side perspective is illustrated, it is to be understood for the avoidance of doubt that a corresponding server perspective exists, or vice versa. Similarly, where a method is practiced, a corresponding device can be provided having storage and at least one processor configured to practice that method via one or more components.
While specific language has been used to describe the disclosure, any limitations arising on account of the same are not intended. As would be apparent to a person in the art, various working modifications may be made to the method in order to implement the inventive concept as taught herein.
While the various embodiments have been described in connection with the preferred embodiments of the various figures, it is to be understood that other similar embodiments may be used or modifications and additions may be made to the described embodiment for performing the same function without deviating there from.
Still further, one or more aspects of the above described embodiments may be implemented in or across a plurality of processing chips or devices, and storage may similarly be affected across a plurality of devices. Therefore, the various embodiments should not be limited to any single embodiment, but rather should be construed in breadth and scope in accordance with the appended claims.
| # | Name | Date |
|---|---|---|
| 1 | 202121010380-STATEMENT OF UNDERTAKING (FORM 3) [11-03-2021(online)].pdf | 2021-03-11 |
| 2 | 202121010380-FORM FOR SMALL ENTITY(FORM-28) [11-03-2021(online)].pdf | 2021-03-11 |
| 3 | 202121010380-FORM FOR SMALL ENTITY [11-03-2021(online)].pdf | 2021-03-11 |
| 4 | 202121010380-FORM 1 [11-03-2021(online)].pdf | 2021-03-11 |
| 5 | 202121010380-EVIDENCE FOR REGISTRATION UNDER SSI(FORM-28) [11-03-2021(online)].pdf | 2021-03-11 |
| 6 | 202121010380-EVIDENCE FOR REGISTRATION UNDER SSI [11-03-2021(online)].pdf | 2021-03-11 |
| 7 | 202121010380-DRAWINGS [11-03-2021(online)].pdf | 2021-03-11 |
| 8 | 202121010380-DECLARATION OF INVENTORSHIP (FORM 5) [11-03-2021(online)].pdf | 2021-03-11 |
| 9 | 202121010380-COMPLETE SPECIFICATION [11-03-2021(online)].pdf | 2021-03-11 |
| 10 | Abstract1.jpg | 2022-06-24 |