首页/语音与转录

语音与转录

45 个技能

Speech & Transcription 相关技能集合

通用

addis-assistant-stt

Provides Speech-to-Text (STT) and text.

agent-voice

Command-line blogging platform for AI agents.

akaunting

Interact with Akaunting open-source accounting software via REST API.

alexa-cli

Control Amazon Alexa devices and smart home via the `alexacli` CLI.

announcer

Announce text throughout the house via AirPlay speakers using Airfoil +.

assemblyai-transcribe

Transcribe audio/video with AssemblyAI.

audio-gen

Generate audiobooks, podcasts, or educational audio content.

audio-reply

Generate audio replies using TTS.

auto-whisper-safe

RAM-safe voice transcription with auto-chunking — works on 16GB machines without crashes.

brw-de-ai-ify

Remove AI-generated jargon and restore human voice to text.

chichi-speech

A RESTful service for high-quality text-to-speech using Qwen3.

claw-voice

You are connected to a live user session via voice.

clonev

Clone any voice and generate speech using Coqui XTTS v2.

critical-article-writer

Generate draft articles, outlines.

cult-of-carcinization

Give your agent a voice — and ears.

deepdub-tts

Generate speech audio using Deepdub and attach it as a MEDIA.

deepgram

— command-line interface for Deepgram speech-to-text.

dellight-cro-revenue-ops

DELLIGHT.AI is an AI startup in DIFC, Dubai.

documents-ai

Real-time OCR and data extraction API by Veryfi.

doubao-api-open-tts

Text-to-Speech service using Doubao (Volcano Engine)

duby

Convert text to speech using Duby.so API.

eachlabs-voice-audio

TTS, STT, voice conversion using ElevenLabs, Whisper, RVC.

easyverein-api

Work with the easyVerein v2.0 REST API.

elevenlabs-agents

Create, manage, and deploy ElevenLabs.

elevenlabs-media

ElevenLabs music generation.

elevenlabs-transcribe

Transcribe audio to text using ElevenLabs.

elevenlabs-tts

ElevenLabs TTS - the best ElevenLabs integration for OpenClaw.

elevenlabs-voices

High-quality voice synthesis with 18 personas, 32.

eternal-haven-lore-pack

Eternal Haven Chronicles lore + mythic persona pack.

faster-whisper

Local speech-to-text using faster-whisper.

feishu-minutes

Fetch info, stats, transcript, and media from Feishu.

freshbooks-cli

FreshBooks CLI for managing invoices, clients, and billing.

gettr-transcribe-summarize

Download audio from a GETTR post.

hebrew-nikud

Hebrew nikud (vowel points) reference for AI agents.

her-voice

Give your agent a voice.

inworld-tts

Text-to-speech via Inworld.ai API.

jarvis-voice

Metallic AI voice persona with TTS and visual transcript styling.

kokoro-tts

Generate spoken audio from text using the local Kokoro TTS engine.

lnbits

Manage LNbits Lightning Wallet (Balance, Pay, Invoice)

lnbits-with-qrcode

Manage LNbits Lightning Wallet (Balance, Pay, Invoice)

miranda-sag

ElevenLabs text-to-speech with mac-style say UX.

norman-categorize-transactions

Review and categorize uncategorized bank transactions, match them with invoices, and verify bookkeeping entries.

norman-monthly-reconciliation

Perform a complete monthly financial reconciliation - review all transactions, match invoices, check outstanding.

ressemble

Text-to-Speech and Speech-to-Text integration using Resemble AI HTTP API.

siliconflow-tts-gen

Text-to-Speech using SiliconFlow API (CosyVoice2)