Abstract: A method of training a capitalization model for automatically capitalizing text, the method comprising: collecting training documents that meet limitations that require the training documents to be associated with a particular user; and using the collected training documents to train the capitalization model.