Identifying hate speech in audio or video files
Our Content Moderation model can help you ensure that your content is safe and appropriate for all audiences.
The model pinpoints sensitive discussions in spoken data and provides information on the severity to which they occurred.
In this guide, we'll learn how to use the Content Moderation model, and look at an example response to understand its structure.
Before we begin, make sure you have an AssemblyAI account and an API key. You can sign up for a free account and get your API key from your dashboard.
The complete source code for this guide can be viewed here.
Here is an audio example for this guide:
Create a new file and import the necessary libraries for making an HTTP request.
Set up the API endpoint and headers. The headers should include your API token.
Upload your local file to the AssemblyAI API.
upload_urlreturned by the AssemblyAI API to create a JSON payload containing the
audio_urlparameter and the
content_safetyparameter set to
POSTrequest to the AssemblyAI API endpoint with the payload and headers.
After making the request, you'll receive an ID for the transcription. Use it to poll the API every few seconds to check the status of the transcript job. Once the status is
completed, you can retrieve the transcript from the API response, using the
content_safety_labelskey to view the results.
Understanding the response
In the JSON response, there'll be an additional key called
content_safety_labels that contains information about any sensitive content detected. The full text is contained in the
text key, and each problematic utterance has its own
timestamp. The entire audio is assigned a
summary and a
severity_score_summary for each category of unsafe content. Each label is returned with a confidence score and a severity score.