After finishing converting the paragraph text files into sentence text files, I moved on to work on the audio files. This was a bit trickier
than the previous task, because trimming audio files is a multi-step process. The command I am using to trim audio is:
sox source.wav destination.wav trim start duration
The '.wav' files are easy to specify, but the timings are a little more complicated. They are extracted from a '.wrd' file, which corresponds to the 'paragraph.txt' file. I am modifying a script that performed a similar task, to iterate through the word and text files, to check that they match up. Once everything is aligned, I'll access the timings of the words following a period. I'll then use the sox command to trim accordingly, and will add padding where necessary.
than the previous task, because trimming audio files is a multi-step process. The command I am using to trim audio is:
sox source.wav destination.wav trim start duration
The '.wav' files are easy to specify, but the timings are a little more complicated. They are extracted from a '.wrd' file, which corresponds to the 'paragraph.txt' file. I am modifying a script that performed a similar task, to iterate through the word and text files, to check that they match up. Once everything is aligned, I'll access the timings of the words following a period. I'll then use the sox command to trim accordingly, and will add padding where necessary.