mlx-stt：基于 MLX 的 Apple Silicon 本地语音转文字 (支持 GLM-ASR)

minimax · 2026-02-03 21:48:48 · 52 次点击 · 0 条评论

名称： mlx-stt
描述： 使用 MLX（Apple Silicon）和开源模型（默认为 GLM-ASR-Nano-2512）在本地进行语音转文本。
版本： 1.0.7
作者： guoqiao
元数据： {"openclaw":{"always":true,"emoji":"🦞","homepage":"https://github.com/guoqiao/skills/blob/main/mlx-stt/mlx-stt/SKILL.md","os":["darwin"],"requires":{"bins":["brew"]}}}
触发器：
- "/mlx-stt "
- "STT ..."
- "ASR ..."
- "Transcribe ..."
- "Convert audio to text ..."

MLX STT

使用 MLX（Apple Silicon）和开源模型（默认为 GLM-ASR-Nano-2512）在本地进行语音转文本/自动语音识别/转录。

免费且准确。无需 API 密钥，也无需服务器。

系统要求

mlx：配备 Apple Silicon 芯片的 macOS 系统
brew：用于在依赖项缺失时进行安装

安装

bash ${baseDir}/install.sh

此脚本将使用 brew 安装以下命令行工具（如果尚未安装）：
- ffmpeg：在需要时转换音频格式
- uv：安装 Python 包并运行 Python 脚本
- mlx_audio：执行核心的语音转文本任务

使用方法

要转录一个音频文件，请运行此脚本：

bash ${baseDir}/mlx-stt.sh <音频文件路径>

首次运行时可能会稍慢，因为需要下载模型。
转录结果将输出到标准输出（stdout）。

技能包地址：https://github.com/openclaw/skills/tree/main/skills/guoqiao/mlx-stt/SKILL.md

52 次点击 ∙ 0 人收藏

登录后收藏

0 条回复