ai-multimodal

vikit-ai

Analyze images/audio/video with Gemini API. Generate images (Imagen 4, Nano Banana 2), videos (Veo 3, Hailuo), speech (MiniMax TTS), music (MiniMax).

Usage

/vk:ai:ai-multimodal
  • Analyze images/audio/video with Gemini API. Generate images (Imagen 4, Nano Banana 2), videos (Veo 3, Hailuo), speech (MiniMax TTS), music (MiniMax).

Examples

  • Vision analysis, OCR, transcription, AI media generation

Related