sole
Molten Core Raider
- 339
- 1,204
Thanks for the .zip, working great. As far as corner cases, ultimately I'd like to have control over them during transcription. Using your example, "Farewell cruel world..." I would like to have a tag marker for the automated count, e.g., "Farewell cruel world...[sentence]" where I deliberately manipulate the transcription to get a more accurate automated response. Ideally, [sentence] would just be a character not found in written world languages, like, "d??" ... "Farewell cruel world...d??". In this case the program would know to associate that symbol with a sentence break. Words are a lot trickier... do you count all characters? do you omit spaces? what if there are ~8 spaces in a row... (this is where the project would become very involved). I have to note every minute detail in this regard, but I'm extremely curious if a simple word/sentence counting/averaging program would aid in judging initial veracity.
For now I think words separated by a space, and a "d??" to separate sentences would be a great way to test out what, if any, correlations there are between automated and manual counting.
Thanks again Tuco, can't wait to test it out.
For now I think words separated by a space, and a "d??" to separate sentences would be a great way to test out what, if any, correlations there are between automated and manual counting.
Thanks again Tuco, can't wait to test it out.