The internet is saturated with audio and video content. There are over 720,000 hours of new videos uploaded to YouTube alone every day! In many situations, this content may contain abusive language like hate speech, or excessive profanity, which could lead to a serious breach in brand safety, user trust, regulation, or even law.
To moderate audio and video content on the internet today, large teams of people are required to manually go through this content in order to flag anything that might be abusive or in violation of a platform’s policies. For example, Facebook employs tens of thousands of people to manually review posts to their platform, and to flag those posts that include hate speech.
Today, AssemblyAI is excited to help solve this problem with a brand new feature, Content Safety Detection, which is now globally available (GA). With Content-Safety Detection, transcriptions can be automatically classified with over 17 labels such as "hate speech," "profanity," "NSFW," and "pornography." The full list of labels can be found in the API documentation. This feature is backed by the same state of the art deep learning research our team applies to our top rated Speech-to-Text API.
Any developer can now easily, and automatically, find sensitive content in audio/video files, without the need for any human in the loop.
At a glance
- Transcribe and classify audio/video that includes Hate Speech, NSFW content, etc. with a single API call
- See exactly where in the transcription text potentially unsafe content was found, along with the timestamp for where the flagged content occurred in the source audio or video file
- Powered by the latest Deep Learning research, not traditional black-lists of words
Example use cases
- Video and Media platforms - adding content warnings to controversial content
- Call Centers - flagging calls that contain hate speech or profanity
- Community Trust & Safety - automatically moderating "high risk" user generated content
- Brand Safety - provide safer options for brands to advertise on a platform
How does Content Safety Detection work?
When the Content Safety Detection feature is enabled, the API will automatically classify your transcription text with one or more labels, such as "NSFW," "profanity," or "hate speech," and will include this information in the API's JSON response.
In the API response, specific sections of a transcription that are flagged by our Content Safety model will be shown. Also within this response is the exact timestamp the flagged text was spoken in the audio file, as well as the confidence score for the Content Safety label that was detected.
The JSON below is an example response from the API for a TED talk on global warming:
As you can see above, the API has flagged a section of the transcription as “disasters” (the label for Natural Disasters) where wildfires caused by global warming are talked about.
You'll also see that the API offers an overall “summary” score for the entire transcription text. This helps show how relevant each predicted label is in reference to the entire transcription text.
For example; let’s say a 45-minute audio file contains just one profanity. While the confidence of the “profanity” label for that word might be 99%, that word may be just one in 50,000 words. Therefore, the confidence score for “profanity” in the summary section of the JSON response would be quite low. The summary scores look at a combination of the frequency and confidence of each predicted label across the entire transcription.
Getting started with Content Safety Detection
Content Safety Detection is now available for all developers simply by adding a flag to your usual transcription request to the `/v2/transcript` API endpoint.
In return, the API will send back a JSON response structured similarly to the response below.
AssemblyAI is very excited to release Content Safety Detection as globally available! We would love to hear your thoughts and ideas on any of the awesome applications that you are building with the AssemblyAI API!