- Timestamp:
- 05/26/08 14:21:04 (8 months ago)
- Files:
Legend:
- Unmodified
- Added
- Removed
- Modified
- Copied
- Moved
Trunk/Scripts/Audio_scripts/AudioSegmentation/AudioBook.pm
r2593 r2594 4 4 =head1 NAME 5 5 6 AudioBook - Convert a single transcribed audio file into an average of 15 word audio segments6 AudioBook - Convert a single transcribed audio file into 15 word audio segments (approximately) 7 7 8 8 =cut … … 50 50 * required for script to run 51 51 52 =head 2NOTES52 =head1 NOTES 53 53 54 54 =head3 Text Does not Match Audio … … 74 74 For larger files (i.e. greater than 30 minutes of audio), you *may* need to manually segment the audio file into 30 minute segments. 75 75 76 =head3 Automatically Adding Out-of- vocabulary words to pronunciation dictionary76 =head3 Automatically Adding Out-of-Vocabulary Words to Pronunciation Dictionary 77 77 78 78 The pronunciations generated by the Sequitor G2P scripts need to be manually reviewed before any new pronunciations are added to the … … 82 82 83 83 =item 1 - Sequitor G2P trainable Grapheme-to-Phoneme converter (which requires Python to be installed) 84 http://www-i6.informatik.rwth-aachen.de/web/Software/g2p.html 85 86 =item 2 - HTK Hidden Markov Model Toolkit - note: the source is "open", but there are distribution restrictions 87 http://htk.eng.cam.ac.uk/ 84 85 http://www-i6.informatik.rwth-aachen.de/web/Software/g2p.html 86 87 =item 2 - HTK Hidden Markov Model Toolkit (note: the source is "open", but there are distribution restrictions) 88 89 http://htk.eng.cam.ac.uk/ 88 90 89 91 =head1 ALGORITHM