Sample passages can be used to generate a good training file.
This will eventually affect the creation of repository.
A good training file should be long and phonetically balanced.
You may use open literature to generate the training file.
Such literature is available at Project Gutenberg.