Sign In to Follow Application
View All Documents & Correspondence

Generating Large Units Of Graphonemes With Mutual Information Criterion For Letter To Sound Conversion

Abstract: A method and apparatus are provided for segmenting works into into component parts. Under the invention, mutual information scores for pairs of graphoneme units found in a set of words are determined. Each graphoneme unit includes at least one letter. The graphoneme unit includes at least one letter. The graphoneme units are combined based on the mutual information score. This forms a new graphoneme unit. Under one aspect of the invention, a syllable n-gram model is trained based on words that have been segmented into syllables using mutual information. The syllable n-gram model is used to segment a phonetic representation of a new word into syllables. Similarly, an inventory of morphemes is formed using mutual information and a morpheme n-gram is trained that can be used to segment a new word into a sequence of morphemes.

Get Free WhatsApp Updates!
Notices, Deadlines & Correspondence

Patent Information

Application #
Filing Date
07 March 2005
Publication Number
52/2006
Publication Type
INA
Invention Field
COMPUTER SCIENCE
Status
Email
Parent Application

Applicants

MICROSOFT CORPORATION
BUSINESS AT ONE MICROSOFT WAY, REDMOND, WASHINGTON 98052,UNITED SATATES OF AMERICA,

Inventors

1. LI JIANG
ONE MICROSOFT WAY, REDMOND, WA 98052, UNITED SATES OF AMERICA
2. MEI-YUH HWANG
ONE MICROSOFT WAY, REDMOND, WA 98052, UNITED SATES OF AMERICA

Specification

Documents