Sign In to Follow Application
View All Documents & Correspondence

A Method And System For Refinement Of Indic Language Translation

Abstract: A method and system is provided for suggesting context sensitive grammatical error correction by generating one or more word forms at real time from a source text to a translated text used for the refinement of Indic language translation. A translated text and a source text is provided as an input which is then parsed into word units based on the space between each word. The word units are further processed to generate a set of most relevant word forms at run time based upon the application of rules like part of speech, root form ending, context recognition and other factors. The disclosure can accept manual and machine translated text as input and can be used to refine the language translations through a browser based user interface, including mobiles. The disclosure also explains method for context sensitive corrections suggestions provided to the user which are automatically determined in real-time.

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #
Filing Date
04 April 2016
Publication Number
40/2017
Publication Type
INA
Invention Field
COMPUTER SCIENCE
Status
Email
iprdel@lakshmisri.com
Parent Application
Patent Number
Legal Status
Grant Date
2023-11-30
Renewal Date

Applicants

TATA CONSULTANCY SERVICES LIMITED
Nirmal Building, 9th Floor, Nariman Point, Mumbai - 400021, Maharashtra, India

Inventors

1. JOSHI, Shrikant
Tata Consultancy Services Limited Tata Research Development & Design Centre, 54-B, Hadapsar Industrial Estate, Hadapsar, Pune - 411 013 Maharashtra, India
2. KALRA, Kanika
Tata Consultancy Services Limited Tata Research Development & Design Centre, 54-B, Hadapsar Industrial Estate, Hadapsar, Pune - 411 013 Maharashtra, India
3. SRIRAMAN, Anand
Tata Consultancy Services Limited Tata Research Development & Design Centre, 54-B, Hadapsar Industrial Estate, Hadapsar, Pune - 411 013 Maharashtra, India
4. KARANDE, Shirish Subhash
Tata Consultancy Services Limited Tata Research Development & Design Centre, 54-B, Hadapsar Industrial Estate, Hadapsar, Pune - 411 013 Maharashtra, India
5. LODHA, Sachin Premsukh
Tata Consultancy Services Limited Tata Research Development & Design Centre, 54-B, Hadapsar Industrial Estate, Hadapsar, Pune - 411 013 Maharashtra, India
6. KUMAR, Rahul
Tata Consultancy Services Limited Tata Research Development & Design Centre, 54-B, Hadapsar Industrial Estate, Hadapsar, Pune - 411 013 Maharashtra, India

Specification

Claims:1. A computer implemented method for suggesting context sensitive grammatical error corrections to an output text of a translation, the method comprising processor implemented steps of:
receiving, a source text and a translated text as an input by a data collector module;
parsing the source text and the translated text into word units using a tokenizer module;
aligning each of the word units of translated text with the corresponding word unit of the source text using an aligner module;
tagging each of the word units of translated text and source text according to at least one factor wherein the at least one factor comprises a grammatical category, a gender, a number and a case using a morphological analyzer module;
performing a linguistic processing on the tagged word units using a language engine component, wherein the linguistic processing comprises:
extracting morphosyntactic rules of a target language by using the synchronous context free grammar technique of a rule extractor module, and
generating one or more word forms of the word units of translated text considering the tag of the word units using a morphological generator module; and
suggesting, using an user interface, the generated one or more word forms, wherein the generated one or more word forms to be displayed in the form of drop-down lists.

2. The method as claimed in claim 1, wherein the aligning further comprises comparing the word units of the translated text with the source text to determine which word in the translated text corresponds to which word in the source text.

3. The method as claimed in claim 1, wherein the extracting morphosyntactic rule further comprises extracting and associating at least one morphological rule, to the tagged word unit.

4. The method as claimed in claim 1, wherein the morphosyntactic rule comprises of associating the tagged word units to at least one of a extracted feature, wherein the extracted feature further comprises at least one of a root form ending, part of speech, numerals, case and gender.

5. The method as claimed in claim 1 ,wherein the generating one or more word forms further comprises providing alternate syntactical word forms, wherein the word forms correspond to change in the extracted feature of the word units at run time.

6. The method as claimed in claim 5, wherein the generating one or more word forms further comprises:
extracting an interrelationship of the said word units to generate a syntactical set of word forms;
tagging the said word units to a second factor based on the interrelationship;
comparing the change between the second factor and previous factor of the word units; and
identifying and associating the most appropriate rule to the tagged word units.
7. A system for suggesting context sensitive grammatical error corrections to an output text of a translation, the system comprising:
a processor;
a memory communicatively coupled with the processor, the memory comprising:
a pre-processor component to receive a source text and a translated text as an input, wherein the pre-processor component comprises:
a data collector module to receive the source text and a translated text;
a tokenizer module to parse the source text and translated text into word units;
an aligner module to align the tokenized word units of translated text corresponding to word units of source text and sending an alignment information to a morphological generator module;
a morphological analyzer module generates output by analyzing and tagging the word units of source text and translated text according to at least one of a grammatical category, a gender, a number and a case;
a language engine component to perform a linguistic processing of the received output of the pre-processor component, wherein the language engine component comprises:
a rule engine to create and store the morphosyntactic rules;
a rule extractor module to extract morphosyntactic rules of a target language by using the synchronous context free grammar technique;
a morphological generator module to generate one or more word forms of the word units of translated text considering the tag of the word units and alignment information sent from the aligner module; and
an user interface to suggest the generated one or more word forms, wherein the word forms to be displayed in the form of drop-down lists.

8. The system claimed in claim 7, wherein the output text of translation comprises a translated text output from manual or automated translation.

9. The system as claimed in claim 7, wherein the rules created by the rule engine module are based on part of speech and root form ending of the word units.

10. The system as claimed in claim 7, wherein the rule engine further configured to perform:
a comparison of a grammatically correct word forms and incorrect word forms of the target language to determine application of the morphological rule;
storing of the determined morphological rules; and

suggesting the list of one or more word forms subsequent to the comparison through the user interface as a drop-down list.
, Description:As Attahced

Documents

Application Documents

# Name Date
1 Form 5 [04-04-2016(online)].pdf 2016-04-04
2 Form 3 [04-04-2016(online)].pdf 2016-04-04
3 Form 18 [04-04-2016(online)].pdf 2016-04-04
4 Drawing [04-04-2016(online)].pdf 2016-04-04
5 Description(Complete) [04-04-2016(online)].pdf 2016-04-04
6 Other Patent Document [16-09-2016(online)].pdf 2016-09-16
7 ABSTRACT.jpg 2018-08-11
8 201621011932-POWER OF ATTORNEY-(02-5-2016).pdf 2018-08-11
9 201621011932-Form 1-220916.pdf 2018-08-11
10 201621011932-Correspondence-220916.pdf 2018-08-11
11 201621011932-CORRESPONDENCE-(02-5-2016).pdf 2018-08-11
12 201621011932-FER.pdf 2020-04-30
13 201621011932-OTHERS [21-10-2020(online)].pdf 2020-10-21
14 201621011932-FER_SER_REPLY [21-10-2020(online)].pdf 2020-10-21
15 201621011932-DRAWING [21-10-2020(online)].pdf 2020-10-21
16 201621011932-COMPLETE SPECIFICATION [21-10-2020(online)].pdf 2020-10-21
17 201621011932-CLAIMS [21-10-2020(online)].pdf 2020-10-21
18 201621011932-US(14)-HearingNotice-(HearingDate-02-11-2023).pdf 2023-10-11
19 201621011932-Correspondence to notify the Controller [17-10-2023(online)].pdf 2023-10-17
20 201621011932-FORM-26 [31-10-2023(online)].pdf 2023-10-31
21 201621011932-Written submissions and relevant documents [17-11-2023(online)].pdf 2023-11-17
22 201621011932-PatentCertificate30-11-2023.pdf 2023-11-30
23 201621011932-IntimationOfGrant30-11-2023.pdf 2023-11-30

Search Strategy

1 US20150046147A1E_28-04-2020.pdf
2 US20120316862A1E_28-04-2020.pdf
3 2020-04-2816-14-17E_28-04-2020.pdf

ERegister / Renewals

3rd: 11 Jan 2024

From 04/04/2018 - To 04/04/2019

4th: 11 Jan 2024

From 04/04/2019 - To 04/04/2020

5th: 11 Jan 2024

From 04/04/2020 - To 04/04/2021

6th: 11 Jan 2024

From 04/04/2021 - To 04/04/2022

7th: 11 Jan 2024

From 04/04/2022 - To 04/04/2023

8th: 11 Jan 2024

From 04/04/2023 - To 04/04/2024

9th: 11 Jan 2024

From 04/04/2024 - To 04/04/2025

10th: 02 Apr 2025

From 04/04/2025 - To 04/04/2026