The most interesting thing is imo that people often say one thing, but put a similar-sounding word or a homophone in the subtitles, and the filter seems to trust the user-supplied subtitles.
I hope nobody trains speach-to-text systems on a tiktok dataset.
I hope nobody trains speach-to-text systems on a tiktok dataset.