Improve punctuation so it follows the rules of grammar
S
Shannon Wedge
I would say at least half of the edits I need to make for my transcripts are fixing issues that Descript introduces because it does bizarre things with punctuation and random capitalization of words like you, somebody, some and that after commas.
Can you stop it from adding a comma every time a presenter pauses to gather their thoughts? I that's why we see extra commas, and commas in the wrong places like after an "and" or "or" or "of" instead of before one. Regardless of whether that's the cause, grammatically those commas need to come before those words, not after them.
A bigger issue is it throws in commas and periods in places that would never have them. There is zero reason that there should ever be period after the words "the", "and" or "or" because humans don't end sentences with these words, but the transcripts are littered with them, and each needs to be removed.
There are only a few cases that there should be a comma or period after the words "to" and never a period or comma after "a" and not only are there tons of both in transcripts, you also can't do a search and replace since you decided to only allow find whole words on words 3 letters/characters or longer! Yes, I can search for them without selecting the match the whole word filter, but when I'm looking to replace "a," it also picks up every time there's a "trauma," which is quite often considering the topic of our lectures.
Also, why does it separate prefixes from the words? We don't write about re experiencing things or non compliance or co workers, but Descript renders prefixes that way and all of those need to be fixed too, as well as insisting on adding a space after the period in numbers with decimals, and both a comma and a space to all numbers in the thousands or higher, which is not how humans write either.
More recently, within the past couple of months, there are a bunch of periods showing up in inappropriate places within the middle of sentences (not immediately before or after a pause, mind you, but within fluidly spoken sentences) that even Descript seems to realize are not correct, because it doesn't capitalize the next word after these periods, which ironically makes them harder to spot given the marker to indicate a pause between words looks somewhat similar to a period.
I've been using this program for 3+ years, and it's frustrating how little attention transcript issues like this are still given. The AI features are great, but maybe you could get AI to fix the punctuation issues while you're at it?