I feel like Descript doesn't work well when you're editing footage with other noises or the video isn't talking based. eg. for a vlog, there are talking sections and non-talking sections where Descript wouldn't be helpful for editing
i think there is overlap with descript, however i do think descript is transcript-first, and they focus on that level of control. I think video should be edited with a timeline as the core driver. I'm sure the right answer is somewhere in the middle, but I think having a robust timeline is important for video editing