mlx-stt
// Speech-To-Text with MLX (Apple Silicon) and opensource models (default GLM-ASR-Nano-2512) locally.
$ git log --oneline --stat
stars:1,933
forks:367
updated:March 4, 2026
SKILL.mdreadonly
SKILL.md Frontmatter
namemlx-stt
descriptionSpeech-To-Text with MLX (Apple Silicon) and opensource models (default GLM-ASR-Nano-2512) locally.
version1.0.7
authorguoqiao
metadata[object Object]
triggers/mlx-stt <audio>,STT ...,ASR ...,Transcribe ...,Convert audio to text ...
MLX STT
Speech-To-Text/ASR/Transcribe with MLX (Apple Silicon) and opensource models (default GLM-ASR-Nano-2512) locally.
Free and Accurate. No api key required. No server required.
Requirements
mlx: macOS with Apple Siliconbrew: used to install deps if not available
Installation
bash ${baseDir}/install.sh
This script will use brew to install these cli tools if not available:
ffmpeg: convert audio format when neededuv: install python package and run python scriptmlx_audio: do the real job
Usage
To transcribe an audio file, run this script:
bash ${baseDir}/mlx-stt.sh <audio_file_path>
- First run could be a little slow, since it will need to download model.
- The transcript result will be printed to stdout.