The Key Phrases model can accurately identify significant words and phrases in your transcription, enabling you to extract the most pertinent concepts or highlights from your audio or video file.
In the Analyzing highlights of call center recordings guide, the client uploads an audio file and configures the API request to use key phrase extraction by including the
You can explore the full JSON response here:
Understanding the response
The JSON object above contains all information about the transcription. Depending on which Models are used to analyze the audio, the attributes of this object will vary. For example, in the quickstart above we did not enable Summarization, which is reflected by the
summarization: false key-value pair in the JSON above. Had we activated Summarization, then the
summary_model key values would contain the file summary (and additional details) rather than the current
To access the Key Phrases information, we use the
The reference table below lists all relevant attributes along with their descriptions, where we've called the JSON response object
results. Object attributes are accessed via dot notation, and arbitrary array elements are denoted with
results.words[i].text refers to the
text attribute of the
i-th element of the
words array in the JSON
|boolean||Whether Key Phrases was enabled in the transcription request|
|object||The results of the Key Phrases model|
|string||Is either |
|array||A temporally-sequential array of Key Phrases|
|number||The total number of times the i-th key phrase appears in the audio file|
|number||The total relevancy to the overall audio file of this key phrase - a greater number means more relevant|
|string||The text itself of the key phrase|
|number||The starting time of the j-th appearance of the i-th key phrase|
|number||The ending time of the j-th appearance of the i-th key phrase|
Frequently Asked Questions
The Key Phrases model uses natural language processing and machine learning algorithms to analyze the frequency and distribution of words and phrases in your transcription. The algorithm identifies key phrases based on their relevancy score, which takes into account factors such as the number of times a phrase occurs, the distance between occurrences, and the overall length of the transcription.
The Key Phrases model is designed to identify important phrases and words in your transcription, whereas the Topic Detection model is designed to categorize your transcription into predefined topics. While both models use natural language processing and machine learning algorithms, they have different goals and approaches to analyzing your text.
Yes, the Key Phrases model can handle misspelled or unrecognized words to some extent. However, the accuracy of the detection may depend on the severity of the misspelling or the obscurity of the word. It's recommended to provide high-quality, relevant audio files with accurate transcriptions for the best results.
Some limitations of the Key Phrases model include its limited understanding of context, which may lead to inaccuracies in identifying the most important phrases in certain cases, such as text with heavy use of jargon or idioms. Additionally, the model assigns higher scores to words or phrases that occur more frequently in the text, which may lead to an over-representation of common words and phrases that may not be as important in the context of the text. Finally, the Key Phrases model is a general-purpose algorithm that can't be easily customized or fine-tuned for specific domains, meaning it may not perform as well for specialized texts where certain keywords or concepts may be more important than others.
To optimize the performance of the Key Phrases model, it's recommended to provide high-quality, relevant audio files with accurate transcriptions, to review and adjust the model's configuration parameters, such as the confidence threshold for key phrase detection, and to refer to the list of identified key phrases to guide the analysis. It may also be helpful to consider adding additional training data to the model or consulting with AssemblyAI support for further assistance.