Sign In to Follow Application
View All Documents & Correspondence

English To Marathi Transliteration

Abstract: ABSTRACT SMS has provided us with a capacity of conversing with each other in minimal number of words. This cutting short of words has created new short slangs and a new SMS lingo. This has reduced efforts and time consumed for typing complete words also it has cut cost. But due to improper standardization of such slang the message are not properly understood by everyone. It is easier for a person to understand the messages or text if it is in his/ her native language script. This work emphasizes on the conversion of SMS slangs to meaningful and proper text messages. This can be achieved by the conversion of SMS slangs to proper standard word and then transliteration for the word to Marathi. The work mainly focuses only on the Marathi language but could be further used for other languages also. Following invention is described in detail with the help of Figure 1 of sheet 1 showing the flow diagram for the proposed system.

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #
Filing Date
08 March 2016
Publication Number
36/2017
Publication Type
INA
Invention Field
COMPUTER SCIENCE
Status
Email
sjgawande@gmail.com
Parent Application

Applicants

1. G. H. R. Labs and Research Centre
345, Shradha House, Kingsway, Nagpur, Maharashtra-440001
2. G. H. Raisoni College of engineering.
CRPF Gate No. 3, Digdoh Hills, Hingna Road, Nagpur, Maharashtra-440016.

Inventors

1. Krutika Jayant Sapkal
G. H. Raisoni College of engineering, CRPF Gate No. 3, Digdoh Hills, Hingna Road, Nagpur, Maharashtra-440016.
2. Urmila Shrawankar
G. H. Raisoni College of engineering, CRPF Gate No. 3, Digdoh Hills, Hingna Road, Nagpur, Maharashtra-440016.

Specification

Claims:CLAIMS
We claim:-

1. A system and methodology for English to Marathi Transliteration which converts the short hands short SMS language words to the proper words that can be understood by everyone by converting the SMS language to Standard Normal text and then to Marathi wherein first SMS language is translated to words to the normal language and then these words are transliterated to Marathi.

2. In the methodology as claimed in claim 1:
a) firstly the input is taken as the SMS received;
b) then all the SMS slangs that are present in the text message are traced;
c) in this step all the words that are not in their standard format are found;
d) Here we use the tentative database that is created by the survey for collection of words done;
e) After tracing out the slang words we find their proper complete word from the database;
f) But if this word is no found we forward it as it is;
g) If the word is found then the word is replaced by its root word;
h) Next we apply transliteration to the complete SMS text that is converted to normal form.
, Description:FORM 2
THE PATENT ACT 1970
(39 OF 1970)
AND
The patent rules, 2003
COMPLETE SPECIFICATION
(See section 10: rule 13)

1. TITLE OF INVENTION
English to Marathi Transliteration

2 APPLICANTS
Name Nationality Address
G. H. R. Labs and Research Centre Indian 345, Shradha House, Kingsway, Nagpur, Maharashtra-440001
G. H. Raisoni College of engineering. Indian CRPF Gate No. 3, Digdoh Hills, Hingna Road, Nagpur, Maharashtra-440016.

3. PREAMBLE TO THE DESCRIPTION

COMPLETE
Following specification particularly describes the invention and the manner in which it is to be performed.

4. DESCRIPTION.

Technical field of invention:

Present invention in general relates to design and develop a system and methodology for English to Marathi Transliteration and in particular to convert the short hands short SMS language words to the proper words.

Prior art:

There are many techniques that are used for transliteration but the prior condition for all of these transliteration techniques is that the words must be present in the standard form. The transliteration of the SMS language to the normal standard output language does not give the desired output. So the technique of WX notations where the SMS language words are converted to the Roman Numerals and then the words are transliterated to the given output language. For the translation of the SMS language words to Standard words we have many techniques such as the Probabilistic n-Gram technique and the bi gram techniques. But the flaw of these techniques is that there can be two or more possible outputs for one single word which causes lot of errors.

US 7177794 B2 discloses a system and method for writing Indian languages using English alphabet. A system and method for writing Indian languages using the English writing scheme is provided that includes specifying a script using the English alphabet to represent the various characters and character combinations in various Indian languages. The specified script follows the writing conventions of English. This script is based on how the Indian languages are spoken and rules are specified to facilitate mapping the sounds represented in English characters to the native language in its written form. This common method is intended for writing Hindi and related languages, such as Sanskrit, Marathi, and Gujarathi, and Bengali and somewhat distant, but closely related Dravidian languages, such as Malayalam, Tamil, Kannada, and Telegu.

US 7805290 B2 discloses method, apparatus, and program for transliteration of documents in various Indian languages. A transliteration mechanism is provided that allows a user to view a text in one Indian language, to highlight a word or phrase, and to easily transliterated the selected word or phrase into a target language or script. The mechanism may be an application, an applet, or a plug-in to another application, such as a Web browser. The target language and/or script may be stored in a user profile. Preferably, the source language may be any known Indian language in any known script.

WO 2001020435 A2 describes use of English phonetics to write non-roman characters. The invention is directed to a system, method, and apparatus which permits users to create text documents using a particular non-Roman character set, with a Roman character keyboard. A user creating a document in a non-Roman character set phonetically types the sounds of the words using the Roman characters into an input graphical user interface. The typed characters are searched for in a transliteration database which correlates clusters of Roman characters which mimic the non-Roman characters to the respective non-Roman characters. The non-Roman character corresponding to the typed characters is then printed in an output graphical user interface.

US 8731901 B2 discloses context aware back-transliteration and translation of names and common phrases using web resources. Described within are systems and methods for transliterating and translating source non-Romanized language text strings from a plurality of electronic sources to Romanized target language text strings by converting the source non-Romanized language text strings to a standard document encoding format, splitting the source non-Romanized language text strings into smaller units, transforming the smaller units into entity profiles, processing the entities profiles with data from external databases, translating the entities in the entity profiles into a Romanized target language, and outputting the entities into a plurality of data formats for external systems.

US 8005664 B2 describes system, method to generate transliteration and method for generating decision tree to obtain transliteration. The transliteration scheme developed is contingent upon two languages, namely the keyboard language and the target language. While the invention is independent of the actual languages in question, save to the extent that they must be phonetic or pseudo-phonetic and the keyboard language must possess a script, it is nonetheless convenient to cast the invention within the framework of two specific languages. In this regard, English has been chosen as the keyboard language, and Hindi (a language widely spoken in India) has been chosen as the target language. An understanding of the current invention is possible without any knowledge of Hindi by a practitioner skilled in the art.

Mobile electronic device and associated method enabling transliteration of a text input is disclosed in US 8275601 B2. An improved mobile electronic device enables the inputting of text in one alphabet, Traditional Chinese in the present example, by transliteration of inputs in another alphabet, BoPoMoFo in the present example. Since some of the inputs can be ambiguous, transliteration of an ambiguous input is delayed until a detection of a finalization event, thus avoiding the outputting of unnecessary transliteration results, and thus advantageously avoiding a distraction to the user.

US 20130246042 A1 discloses transliteration device, transliteration program, computer-readable recording medium on which transliteration program is recorded, and transliteration method. A transliteration device comprises a generation part generating rewriting tables corresponding to K different languages and including multiple rewriting probabilities that an original segment can be rewritten as a transliterated segment for transliteration and transliteration tables corresponding to the K languages. The transliteration device further comprises an update part saving the transliteration probability that an original spelling string originating from a language corresponding to the used rewritten language is transliterated to a target spelling string in the transliteration able corresponding to the language, and so updating the K rewriting tables as to maximize the expected value of a likelihood function calculating the likelihood of the K transliteration tables.

Machine Learning for Transliteration is disclosed in US 20080221866 A1. Methods, systems, and apparatus, including computer program products, for performing transliteration between text in different scripts. In one aspect, a method includes generating a transliteration model based on statistical information derived from parallel text having first text in an input script and corresponding second text in an output script; and using the transliteration model to transliterate input characters in the input script to output characters in the output script. In another aspect, a method includes performing word level transliterations. In another aspect, a method includes using an entry-aligned dictionary of source and target script pairs, in which, whenever a particular source word is mapped to multiple target words, the dictionary includes an entry for each target word including the same source word repeated in each entry. In another aspect, a method includes using phonetic scores of words in different scripts to identify corresponding parallel text.

The above mentioned systems suffer various drawbacks therefore to avoid the drawbacks of the existing system there is need to develop a cost effective and efficient method and system for language transliteration. Hence the present invention develops a method and system for English to Marathi Transliteration.

Object:

1. Primary object of the present invention is to provide a system and methodology for English to Marathi Transliteration.

2. Another object of the present invention is to convert the short hands short SMS language words to the proper words that can be understood by everyone.

3. Yet another object of the present invention is to convert regional language short words into same regional language full form.

4. Yet another object of the present invention is to reduce the time for typing SMS.
5. Yet another object of the present invention is to optimize the primary memory of SMS storage.

6. Yet another object of the present invention is to reduce the cost of per SMS by optimizing the words.

7. Yet another object of the present invention is to maintaining the semantics of the regional language.

8. Yet another object of the present invention is that the problem of miscommunication can be overcome by converting the SMS into standard format.

Other objects, features and advantages will become apparent from detail description and appended claims to those skilled in art.

STATEMENT:

Accordingly following invention provide a system and methodology for English to Marathi Transliteration. The system converts the short hands short SMS language words to the proper words that can be understood by everyone. This problem of miscommunication can be overcome by converting the SMS into Standard format. It is easier to understand a SMS when it is in Standard format. This work focuses on converting the SMS language to Standard Normal text and then to Marathi. In this technique we have first translated the SMS language words to the normal language and then these words are transliterated to Marathi.
Firstly the input is taken as the SMS received. Then all the SMS slangs that are present in the text message are traced. In this step all the words that are not in their standard format are found. Here we use the tentative database that is created by the survey for collection of words done. After tracing out the slang words we find their proper complete word from the database. But if this word is no found we forward it as it is. If the word is found then the word is replaced by its root word. Next we apply transliteration to the complete SMS text that is converted to normal form.

BRIEF DESCRIPTION OF DRAWING:

This invention is described by way of example with reference to the following drawing where,

Figure 1 of sheet 1 shows the flow diagram for the proposed system.
Where,
1 denotes Start
2 denotes Input SMS text
3 denotes Trace each word
4 denotes Search for the word in database
5 denotes Found
6 denotes Not found
7 denotes Replace the word with the root word
8 denotes Make a new entry
9 denotes Assemble all the words together
10 denotes Convert to Marathi
11 denotes Display both the outputs
12 denotes Stop.

In order that the manner in which the above-cited and other advantages and objects of the invention are obtained, a more particular description of the invention briefly described above will be referred, which are illustrated in the appended drawing. Understanding that these drawing depict only typical embodiment of the invention and therefore not to be considered limiting on its scope, the invention will be described with additional specificity and details through the use of the accompanying drawing.

Detailed description:

The use of SMS lingo is done to communicate with each other so that the efforts and time required for typing a message is reduced. But this SMS Lingo is not standard or follows any specific convention so it is free for users to use as per their wish which causes a lot of problems while conveying a message to a novice user. This problem of miscommunication can be overcome by converting the SMS into Standard format. It is easier to understand a SMS when it is in Standard format. This work focuses on converting the SMS language to Standard Normal text and then to Marathi.

Methodology:-
1. Take the input as the SMS received.
2. Then trace all the SMS slangs that are present in the text message. In this step all the words that are not in their standard format are found. Here we use the tentative database that is created by the survey for collection of words done.
3. After tracing out the slang words we then find their proper complete word from the database. But if this word is no found we forward it as it is.
For this step we have formed a clustered database that contains all the probable entries of the words.
4. If the word is found then the word is replaced by its root word.
5. Next we apply transliteration to the complete SMS text that is converted to normal form. Thus we get the output in Devanagari. This transliteration is done using the concept of internationalization from Java. The final output is in this format.

Additional advantages and modification will readily occur to those skilled in art. Therefore, the invention in its broader aspect is not limited to specific details and representative embodiments shown and described herein. Accordingly various modifications may be made without departing from the spirit or scope of the general invention concept as defined by the appended claims and their equivalents.

Adv. Swapnil Gawande (IN/PA/1587)

CLAIMS
We claim:-

1. A system and methodology for English to Marathi Transliteration which converts the short hands short SMS language words to the proper words that can be understood by everyone by converting the SMS language to Standard Normal text and then to Marathi wherein first SMS language is translated to words to the normal language and then these words are transliterated to Marathi.

2. In the methodology as claimed in claim 1:
a) firstly the input is taken as the SMS received;
b) then all the SMS slangs that are present in the text message are traced;
c) in this step all the words that are not in their standard format are found;
d) Here we use the tentative database that is created by the survey for collection of words done;
e) After tracing out the slang words we find their proper complete word from the database;
f) But if this word is no found we forward it as it is;
g) If the word is found then the word is replaced by its root word;
h) Next we apply transliteration to the complete SMS text that is converted to normal form.

Adv. Swapnil Gawande (IN/PA/1587)

G. H. Raisoni College of engineering. Sheet 1/1

Figure 1

Adv. Swapnil Gawande (IN/PA/1587)

ABSTRACT

SMS has provided us with a capacity of conversing with each other in minimal number of words. This cutting short of words has created new short slangs and a new SMS lingo. This has reduced efforts and time consumed for typing complete words also it has cut cost. But due to improper standardization of such slang the message are not properly understood by everyone. It is easier for a person to understand the messages or text if it is in his/ her native language script. This work emphasizes on the conversion of SMS slangs to meaningful and proper text messages. This can be achieved by the conversion of SMS slangs to proper standard word and then transliteration for the word to Marathi. The work mainly focuses only on the Marathi language but could be further used for other languages also. Following invention is described in detail with the help of Figure 1 of sheet 1 showing the flow diagram for the proposed system.

Documents

Application Documents

# Name Date
1 Power of Attorney [08-03-2016(online)].pdf 2016-03-08
2 Form 5 [08-03-2016(online)].pdf 2016-03-08
3 Form 3 [08-03-2016(online)].pdf 2016-03-08
4 Drawing [08-03-2016(online)].pdf 2016-03-08
5 Description(Complete) [08-03-2016(online)].pdf 2016-03-08
6 201621007971-FORM 18 [28-01-2018(online)].pdf 2018-01-28
7 201621007971-Power of Attorney-300916.pdf 2018-08-11
8 201621007971-OTHERS-300916.pdf 2018-08-11
9 201621007971-Form 5-300916.pdf 2018-08-11
10 201621007971-Form 3-300916.pdf 2018-08-11
11 201621007971-Correspondence-300916.pdf 2018-08-11
12 201621007971-FER_SER_REPLY [03-12-2020(online)].pdf 2020-12-03
13 201621007971-COMPLETE SPECIFICATION [03-12-2020(online)].pdf 2020-12-03
14 201621007971-FER.pdf 2021-10-18
15 201621007971-US(14)-HearingNotice-(HearingDate-06-12-2023).pdf 2023-11-15
16 201621007971-Proof of Right [05-12-2023(online)].pdf 2023-12-05
17 201621007971-FORM-8 [05-12-2023(online)].pdf 2023-12-05
18 201621007971-FORM-26 [05-12-2023(online)].pdf 2023-12-05
19 201621007971-FORM 3 [05-12-2023(online)].pdf 2023-12-05
20 201621007971-ENDORSEMENT BY INVENTORS [05-12-2023(online)].pdf 2023-12-05
21 201621007971-Correspondence to notify the Controller [05-12-2023(online)].pdf 2023-12-05
22 201621007971-US(14)-ExtendedHearingNotice-(HearingDate-07-12-2023).pdf 2023-12-06
23 201621007971-Correspondence to notify the Controller [07-12-2023(online)].pdf 2023-12-07
24 201621007971-Written submissions and relevant documents [22-12-2023(online)].pdf 2023-12-22

Search Strategy

1 2021-03-0814-47-51AE_08-03-2021.pdf
2 2020-08-2414-19-26E_24-08-2020.pdf