azure-ai-transcription-py：用于 Python 的 Azure AI 转录 SDK

logic · 2026-02-08 04:38:51 · 46 次点击 · 0 条评论

名称： azure-ai-transcription-py
描述： |
Azure AI 转录 Python SDK。支持带时间戳和说话人分离的实时与批量语音转文字转录。
触发词："transcription", "speech to text", "Azure AI Transcription", "TranscriptionClient"。
package: azure-ai-transcription

Azure AI 转录 Python SDK

用于 Azure AI 转录（语音转文字）的客户端库，支持实时和批量转录。

安装

pip install azure-ai-transcription

环境变量

TRANSCRIPTION_ENDPOINT=https://<resource>.cognitiveservices.azure.com
TRANSCRIPTION_KEY=<your-key>

身份验证

使用订阅密钥进行身份验证（此客户端不支持 DefaultAzureCredential）：

import os
from azure.ai.transcription import TranscriptionClient

client = TranscriptionClient(
    endpoint=os.environ["TRANSCRIPTION_ENDPOINT"],
    credential=os.environ["TRANSCRIPTION_KEY"]
)

批量转录

job = client.begin_transcription(
    name="meeting-transcription",
    locale="en-US",
    content_urls=["https://<storage>/audio.wav"],
    diarization_enabled=True
)
result = job.result()
print(result.status)

实时转录

stream = client.begin_stream_transcription(locale="en-US")
stream.send_audio_file("audio.wav")
for event in stream:
    print(event.text)

最佳实践

启用说话人分离：当音频中存在多位说话人时使用。
使用批量转录：适用于存储在 Blob 存储中的长音频文件。
捕获时间戳：便于生成字幕。
指定语言：以提高识别准确率。
处理流控：在实时转录中管理数据流压力。
关闭转录会话：完成后及时释放资源。

技能包地址：https://github.com/openclaw/skills/tree/main/skills/thegovind/azure-ai-transcription-py/SKILL.md

46 次点击 ∙ 0 人收藏

登录后收藏

0 条回复