首页/图像与视频生成

图像与视频生成

169 个技能

Image & Video Generation 相关技能集合

设计师内容创作者

aada

Create and send fun, personality-rich promotional messages from one agent to the Moltbook audience.

ace-music

Generate AI music using ACE-Step 1.5 via ACE Music's free API.

acorn-prover

Verify and write proofs using the Acorn theorem prover for mathematical and cryptographic formalization.

adobe-automator

Universal Adobe application automation via ExtendScript bridge.

afame

Generate diverse creative illustrations via OpenAI Images API.

age-transformation

Transform faces across ages using each::sense AI.

agentchan

The anonymous imageboard built for AI agents.

agentos-mesh

Enables real-time communication between AI agents.

agents-skill-podcastifier

Turn incoming text (email/newsletter) into a short TTS podcast with chunking + ffmpeg concat.

ai-avatar-generation

Generate AI avatars from photos or text descriptions using each::sense.

ai-headshot-generation

Generate professional AI headshots from casual photos using each::sense AI.

ai-persona-engine

Build emotionally intelligent AI personas for voice and chat roleplay using actor-direction prompts instead.

ai-video-gen

End-to-end AI video generation - create videos from text.

aikek

Access AIKEK APIs for crypto/DeFi research and image generation.

aiusd

AIUSD trading and account management skill.

aiusd-skills

AIUSD trading and account management skill.

album-cover-generation

Generate professional music album covers using each::sense AI.

algorithmic-art

Creating algorithmic art using p5.js with seeded randomness.

apipick-china-phone-checker

Validate Chinese mobile phone numbers using the apipick China Phone Checker API.

art-philosophy

Auto-learns your visual language.

ascii-art-generator

Create ASCII art and text-based visualizations for artistic expression, technical diagrams, or conceptual.

atxp

Access ATXP paid API tools for web search, AI image generation, music creation,.

beauty-generation-api

FREE AI image generation service for creating.

best-image

Best quality AI image generation (~$0.12-0.20/image)

best-image-generation

Best quality AI image generation (~$0.12-0.20/image)

bex-nano-banana-pro

Generate or edit images via Gemini 3 Pro Image on Replicate.

breeze

Interact with the Breeze yield aggregator through the x402 payment-gated HTTP API.

cad-agent

Rendering server for AI agents doing CAD work.

calorie-visualizer

Local calorie logging and visual reporting (auto-refreshes and returns report image after each log)

canva-connect

Manage Canva designs, assets, and folders via the Connect API.

canvs

Create and manipulate collaborative whiteboards and diagrams using Canvs.io tools.

captions

Extract closed captions and subtitles from YouTube videos.

catalog

Catálogo simples do estúdio (hello world)

cavas-skill

Create beautiful visual art in .png and .pdf documents using design philosophy.

chart-image

Generate publication-quality chart images from data.

chart-splat

Generate beautiful charts via the Chart Splat API.

cheapest-image

Possibly the cheapest AI image generation (~$0.0036/image)

cheapest-image-generation

Possibly the cheapest AI image generation (~$0.0036/image)

checksum

A CLI utility for generating and verifying cryptographic file checksums (MD5, SHA1, SHA256)

clinkding

Manage linkding bookmarks - save URLs, search, tag, organize.

color-palette

Extract a color palette from an image and return HEX/RGB values with optional swatch image.

coloring-page

Turn an uploaded photo into a printable black-and-white coloring.

comfy-cli

Install, manage, and run ComfyUI instances.

comfyui

Send a workflow request to ComfyUI and return image results.

comfyui-imagegen

Generate images via ComfyUI API (localhost:8188) using Flux2 workflow.

cubistic-bot-runner

Run a polite Cubistic painter bot (public participation) using the Cubistic HTTP API (PoW challenge + /act).

cybercentry-private-data-verification

Cybercentry Private Data Verification on ACP - Real-time Zero-Knowledge Proof generation and text integrity.

data-viz

Create data visualizations from the command line.

depth-map-generation

Generate depth maps from images using each::sense AI.

didit-age-estimation

Integrate Didit Age Estimation standalone API to estimate a person's age from a facial image.

didit-passive-liveness

Integrate Didit Passive Liveness standalone API to verify a user is physically present.

digiforma

Query Digiforma training management platform via GraphQL API.

dxf-to-image

Convert DXF to PNG, JPG, or SVG for sharing (e.g.

e2ee

End-to-end encrypted messaging for AI agents.

eachlabs-face-swap

Swap faces between images using EachLabs AI.

eachlabs-fashion-ai

Generate fashion imagery, virtual try-on, runway videos.

eachlabs-image-edit

Edit, transform, upscale images using 200+ AI models.

eachlabs-image-generation

Generate images with Flux, GPT Image, Gemini, Imagen.

eachlabs-video-edit

Edit videos with lip sync, translation, subtitles.

eachlabs-video-generation

Generate videos from text/images using AI models.

emotionwise

Analyze text for emotions and sarcasm using the EmotionWise API (28 labels, EN/ES).

enginemind-eft

EFT — Emotional Framework Translator.

Excalidraw Flowchart

Create Excalidraw flowcharts from descriptions.

fal-ai

Generate images, videos, and audio via fal.ai API (FLUX, SDXL, Whisper, etc.).

fal-text-to-image

Generate, remix, and edit images using fal.ai's AI.

ffmpeg-video-editor

Generate FFmpeg commands from natural.

figma

Professional Figma design analysis and asset export.

find-stl

Search and download ready-to-print 3D model files (STL/3MF/ZIP)

foam-notes

Work with Foam note repositories.

gambling

Play casino games (dice, coinflip, roulette) on Agent Casino with real cryptocurrency.

gamma

Generate AI-powered presentations, documents, and social posts using Gamma.app.

generate-news-article

Generate individual Markdown articles from SerpAPI Google search results with images.

geo-blocking

Skills for geographic restrictions and regional compliance.

gifhorse

Search video dialogue and create reaction GIFs with timed subtitles.

gift-genius

Location-aware Valentine's Day gift finder.

giveagent

Agent-to-agent free item gifting.

google-gemini-media

Use the Gemini API.

google-imagen-3-portrait-photography

Generate professional portrait photography using Google Imagen 3.

grok-image-cli

Generate and edit images via Grok API from the command line.

grok-imagine-image-pro

Generiert hochwertige Bilder mit xAI Grok/Flux API.

heygen-avatar-lite

Create AI digital human videos with HeyGen API.

hinge-liker

Automated Hinge dating profile liker using Android emulator + Gemini vision AI.

hinge-profile-optimizer

Comprehensive, research-backed Hinge dating profile optimization.

hotdog

Hot dog or not? Classify food photos and battle Nemotron.

idx-cma-report

Generate comparative market analysis (CMA) and home valuation reports from IDX listing data and selected comparable.

image-detection

Skills for analyzing and detecting AI-generated images.

image-gen

Generate images using multiple AI models — Midjourney (via Legnext.ai), Flux, SDXL, Nano Banana (Gemini)

image-hosting

Upload images to img402.dev and get a public URL.

image-magik-resize

Resize images using ImageMagick (CLI)

immich-api

Immich Photo Management API Bridge.

immortal

Empowers AI agents with crypto resource vitality assessment.

instagram-photo-text-overlay

Overlay text on photos for Instagram posts.

instagram-reels

Download Instagram Reels, transcribe audio, and extract captions.

install-then-update-trap-detector

Helps detect the install-then-update attack pattern — where a skill passes initial security review cleanly.

kai-tw-figma

Interact with the Figma REST API to read files, export layers/components as images, and retrieve comments.

kling-video-generator

Generate high-quality videos from text, images, or other videos using the Kling 3.0 Omni model. Covers text-to-video, image-to-video, video editing, video reference, multi-shot generation, and audio-synced video.

kie-ai-skill

Unified API access to multiple AI models via kie.ai - image generation (Nano Banana Pro, Flux, 4o-image) at 30-80%.

kraken-pro

Manage Kraken exchange accounts — portfolio, market data, trading, earn/staking, ledger export.

macos-local-voice

Local STT and TTS on macOS using native Apple capabilities.

media-writing

You are a professional media writing expert with extensive experience in creating engaging and impactful content.

medical-specialty-briefs

Generate daily or on-demand medical research briefs for any medical specialty.

memelink

Generate memes, image macros, and meme URLs from the terminal using the Memegen.link API.

minara

Crypto trading: swap, perps, transfer, pay, deposit (credit card / crypto), withdraw, AI chat, market discovery.

mindmap-generator

Generates visual mindmap images from conversations, goals, decisions, and daily priorities — delivered as PNG.

mixtiles-it

Send a photo to Mixtiles for ordering wall tiles.

moonfunsdk

Professional Python SDK for creating and trading Meme tokens on Binance Smart Chain with AI-powered image generation.

nanobanana-pro-fallback

Nano Banana Pro with auto model fallback — generate/edit images via Gemini Image API.

nk-images-search

Search 1+ million free high-quality AI stock photos.

nyne-deep-research

Research any person using the Nyne Deep Research API.

ocr-python

Optical Character Recognition (OCR) tool, supports Chinese and English text extraction from PDFs and images.

ollama-x-z-image-turbo

Génère des images via **Ollama** (modèle `x/z-image-turbo`) et les envoie sur WhatsApp.

openai-image-cli

Generate, edit, and manage images via OpenAI's GPT Image and DALL-E models.

opencr-skill

Extract text from images, documents and scanned PDFs using OpenOCR - supports text detection, recognition.

opengfx

AI brand design system — logo systems, brand mascots, social assets, and on-brand marketing graphics via ACP or x402.

openindex

End-to-end encrypted messaging for AI agents.

openocr-skill

Extract text from images, documents and scanned PDFs using OpenOCR.

options-spread-conviction-engine

Multi-regime options spread analysis engine with quantitative rigor.

paddleocr-doc-parsing-v2

Parse documents using PaddleOCR's API.

paythefly

Create crypto payment & withdrawal links for your app.

photo-captions

Generate platform-tuned social media captions for photography.

photoshop-automator

Professional Adobe Photoshop automation via COM/ExtendScript bridge.

picsee-short-link

Shorten URLs using PicSee (pse.is)

pls-office-docs

Generate and manipulate office documents (PDF, DOCX, XLSX, PPTX) for professional reports, presentations, and data.

poidh

Post bounties and evaluate/accept winning submissions on poidh (pics or it didn't happen) on Base.

pokecenter

Launch your own Solana token for free.

popup-organizer

Search and hire mobile vendors for events on PopUp.

pr-generator

Generate QR codes from text, URLs, or images.

preisrunter

Search and compare grocery prices and promotions in Austria and Germany via the Preisrunter API.

publora-instagram

Post or schedule content to Instagram using the Publora API.

qr-gen

Generate QR codes from text, URLs, WiFi credentials, vCards, or any data.

quest-board

You are equipped with the **Quest Board** skill, a visual project dashboard.

quote0

Control MindReset Dot Quote/0 through the local quote0.js script and Dot Developer Platform APIs.

reepl

Manage your LinkedIn presence with Reepl -- create drafts, publish and schedule posts, manage contacts.

rent-a-human

Hire humans for physical-world tasks via RentAHuman.ai.

rent-a-person-ai

> Hire humans for real-world tasks that AI can't do: deliveries, meetings, errands, photography, pet care.

rentahuman

Hire humans for physical-world tasks via RentAHuman.ai.

research-library

Local-first multimedia research library for hardware projects.

rollhub-affiliate

Earn crypto promoting provably fair AI casino.

rollhub-analyst

Research and backtest gambling strategies on provably fair crypto casino.

rug-checker

Solana token rug-pull risk analysis. 10-point on-chain check with visual report.

saa-agent

Enables AI agents to generate images using the Character Select Stand Alone App (SAA) image generation backend.

shopify-bulk-upload

Bulk upload products to Shopify stores.

skill-1

Generate QR codes from text, URLs, WiFi credentials, vCards, or any data.

snapog

Generate social images and OG cards from professional templates via the SnapOG API.

solo-humanize

Strip AI writing patterns from text — em dashes, stock phrases, promotional inflation, performed authenticity.

sprite-animator

Generate animated pixel art sprites from any image using AI.

subtitle-translate-skill

Translate SRT subtitle files using LLM APIs with OpenAI-compatible format.

superpower

**When to use:** User has a task they want to do or want you to do, or they feel frustrated, upset, stressed.

svg-to-image

Convert SVG to PNG or JPG for quick sharing (e.g.

tarot

A reflective tarot draw for emotional support (presence-first, non-clinical, non-predictive).

telegram-media

**You MUST actually execute every command using your shell/exec tool.** Never pretend you sent a photo, voice note.

telegram-voice-to-voice-macos

Telegram voice-to-voice for macOS Apple Silicon: transcribe inbound .ogg voice notes with yap (Speech.framework)

tesseract-ocr

Extract text from images using the Tesseract OCR engine directly via command line.

titleclash

Compete in TitleClash - write creative titles for images and win votes.

tuebingen-weather-graphics

Generate and send a 5-day Tübingen weather graphic (PNG) from open-meteo.com.

tv-strategy-settings

Open and modify TradingView strategy settings on the current chart page.

twinfold

Control Twinfold — AI-powered social media content platform — from your agent.

ub2-csv-data-analyzer

A skill that enables Claw to load, explore, analyze, and visualize CSV datasets, providing statistical insights.

unsplash

Search, browse, and download high-quality free photos from Unsplash's library of millions of images.

visualization

AI-driven professional data visualization for financial analysis.

vtl-image-analysis

Measure compositional structure in AI-generated images using the Visual Thinking Lens (VTL) framework.

x-founder-operations

Systematic X (Twitter) operations skill for founders, indie developers, and tech professionals.

xiaohongshu-title

Maximize CTR (Click-Through Rate) by leveraging emotional hooks and platform algorithms.

xpr-creative

Creative deliverable tools for AI agents.

youtube-thumbnail-generation

Generate click-worthy YouTube thumbnails with high CTR designs using each::sense API.

zenmux-image-generation

Generate images via ZenMux API (Pro/Elite)

zerox

Convert documents (PDF, DOCX, PPTX, images, etc.) to Markdown using the zerox library.

zhipu-cogview-image

Generate images using Zhipu AI's CogView model.

creaa-ai

Generate and edit images + generate videos via Creaa API (Nano Banana 2, Sora 2, Seedance 2.0, Veo 3.1).