Yeah it's weird. I have these in depth technical conversations with every guest where it's great, I love this part. The frequency of verbal ticks and filler content really takes an edit from "this isn't too bad" to "what the fuck am I doing with my life?" all based on how many times you need to remove filler content within the first 5 minutes of editing a 90 minute show.
I'm kind of surprised that wedding producer openly said that. My philosophy has always been the opposite. One of my main goals of the show is to make the guest walk away thinking this was the best podcast experience they ever had from start to finish as well as do everything I can to make them come off as good as possible.
I rarely cut content but most episodes have hundreds of manual edits to remove filler content and create a more concise flow by removing long pauses because my 2nd main goal is to optimize for the listener. I keep the edits organic at the same time by leaving in some filler content and subtle things like a deep inhale or a sigh because there's a lot of meaning around that when it comes to sentiment and tone, the same can be said for sometimes leaving in an extra 500ms pause to amplify the meaning behind something. At the same time, sometimes filler content gets left in because it flowed too quickly into the next word so cutting it sounds too unnatural as if it clipped.
This is why I think it's a crazy hard problem to get a machine to be able to make decisions like this.
I do use separate recordings (we each record our track locally), it definitely helps eliminate the few cases where we talk over each other or being able to lower the volume of a laugh so it doesn't overpower what the other person said while still keeping it in because it's a good part of a conversation and a snort or laugh can easily be the difference between a listener wondering if the guest was offended or happily agreeing with something.
I'm kind of surprised that wedding producer openly said that. My philosophy has always been the opposite. One of my main goals of the show is to make the guest walk away thinking this was the best podcast experience they ever had from start to finish as well as do everything I can to make them come off as good as possible.
I rarely cut content but most episodes have hundreds of manual edits to remove filler content and create a more concise flow by removing long pauses because my 2nd main goal is to optimize for the listener. I keep the edits organic at the same time by leaving in some filler content and subtle things like a deep inhale or a sigh because there's a lot of meaning around that when it comes to sentiment and tone, the same can be said for sometimes leaving in an extra 500ms pause to amplify the meaning behind something. At the same time, sometimes filler content gets left in because it flowed too quickly into the next word so cutting it sounds too unnatural as if it clipped.
This is why I think it's a crazy hard problem to get a machine to be able to make decisions like this.
I do use separate recordings (we each record our track locally), it definitely helps eliminate the few cases where we talk over each other or being able to lower the volume of a laugh so it doesn't overpower what the other person said while still keeping it in because it's a good part of a conversation and a snort or laugh can easily be the difference between a listener wondering if the guest was offended or happily agreeing with something.