email from David Gelbart
adding more acoustic model parameters
The general rule I have seen with ASR systems is that, as the amount of training data increases, it eventually becomes necessary to add more acoustic model parameters in order to get the full benefit of the additional data. On the other hand, using too many acoustic model parameters may cause overfitting (in other words, the system starts modeling quirks of the training data to the point where the system's performance on non-training data is worsened).
Thus, you may need to periodically tune the number of acoustic model parameters you are using. I suppose the easiest way to do this is to create a test set which does not overlap with the training set, and measure word recognition accuracy on the test set for various acoustic model sizes.
use more Gaussians in the Gaussian mixtures
One way to increase the number of parameters is to use more Gaussians in the Gaussian mixtures. (One way to do this in HTK is to add one or more additional mixup stages. This has the advantage that you can use your test set to compare recognition accuracy before and after the mixup, so that you can obtain your recognition accuracy numbers without having to retrain a system from scratch each time.)
move from monophones to triphones
Another way to increase the number of parameters is to move from monophones to triphones (unless you are using triphones already).
reduce the amount of state-tying
Another way is to reduce the amount of state-tying.