ai-multimodal
vikit-aiAnalyze images/audio/video with Gemini API. Generate images (Imagen 4, Nano Banana 2), videos (Veo 3, Hailuo), speech (MiniMax TTS), music (MiniMax).
Usage
/vk:ai:ai-multimodal- Analyze images/audio/video with Gemini API. Generate images (Imagen 4, Nano Banana 2), videos (Veo 3, Hailuo), speech (MiniMax TTS), music (MiniMax).
Examples
- Vision analysis, OCR, transcription, AI media generation