Docs
Back to Home
Back to Home

Voiceprint and Speaker Management

The voiceprint library is the core function of Owl Meeting to achieve "knowing who is speaking." By pre-recording the voice samples of each person, the system can automatically identify and label the speaker's name during file transcription, and even specify the most suitable recognition model for different people.

Last Updated: 2026-04-21 · Document Language: English

1. Adding a Speaker

  1. Enter [Speakers] in the left toolbar.
  2. Click [Add Speaker], fill in the name (required) and remarks (optional).
  3. Assign a recognition model for the speaker: When [Smart] in File Transcription is turned on, the system will automatically use the model specified here to recognize the speaker's voice.
  4. Voiceprint Library Management Interface Voiceprint Library Management Interface

2. Adding Voiceprint Samples

  1. Select a speaker and click [Add Audio].
  2. Select an audio file containing the speaker's clear human voice.
  3. Set the start/end time in the cropping window, and click audition to confirm.
  4. Select [Voiceprint Language]: Select [Chinese] for Chinese samples and [English] for English samples. Other languages can be selected based on the language family.
  5. Click save, the system will automatically extract the voiceprint features and associate them with the speaker.
  6. Voiceprint Sample Addition and Cropping Voiceprint Sample Addition and Cropping

Best Practices for Sample Collection

3. Daily Maintenance

4. How Voiceprint Library Takes Effect in Transcription

The voiceprint library mainly plays a role in Offline File Transcription. To have the transcription results automatically display the speaker's name, the following conditions need to be met simultaneously:

  1. Select [Speaker] as the segmentation method.
  2. Turn on the [Identity Mark] switch.
  3. The [Voiceprint Language] in the File Transcription settings is consistent with the language selected when adding samples.

After meeting the above conditions, the speaker tag in the recognition result will be automatically replaced with the real name entered in the voiceprint library.

5. FAQ and Troubleshooting

Suggestion: Add 1-2 pieces of clear human voice samples for each common participant when establishing the library. Once the voiceprint library is established, all subsequent file transcriptions can automatically identify identities without repeated configuration.