#Multimodal AI

19 tools tagged “Multimodal AI


Showing 19 tools

#1

GenMix

All-in-one AI platform with 20+ models for video, image, and voice creation

4.7 (6)
Freemium
#2

wan2-6.org

Multimodal AI video generator producing 1080p clips from text, images, and reference inputs.

4.8 (4)
Free
#3

Reka AI

Multimodal foundation models that understand text, images, video, and audio.

5.0 (5)
Freemium
#4

AMIE

A multimodal AI diagnostic agent that conducts clinical conversations and interprets medical images for accurate diagnoses.

4.7 (6)
Free
#5

chris li

Multimodal AI video generator that turns text, images, or audio into short videos.

4.5 (6)
Free
#6

Zenor AI

Multimodal AI shopping assistant for Shopify stores using text, voice, and image.

4.8 (6)
Free
#7

OpenAdapt

An open-source framework automating desktop workflows using large multimodal models.

5.0 (5)
Freemium
#8

Gemini Omni

Multimodal AI for generating, editing, and rendering production-ready video.

4.0 (4)
Free
#9

evolink

Unified API for multimodal AI across chat, image, and video models.

4.8 (5)
Freemium
#10

Seedance

Multimodal AI platform for text‑to‑video, image‑to‑video, text‑to‑image, and voiceover creation.

4.6 (5)
Free
#11

HappyHorse-model

Multimodal AI platform for generating videos, images, and audio from text or media prompts.

4.8 (6)
Free
#12

Wan 2.7 AI Video Generator

Multimodal AI platform for generating consistent, controllable video from text, images, and references.

4.2 (6)
Free
#13

Unitree R1

Compact 26-joint humanoid robot with multimodal AI for research and education

4.5 (4)
Paid
#14

C Dance ai

Multimodal AI video generator that turns text, images, audio, and clips into dance videos.

4.5 (4)
Free
#15

Google Gemini 2.0

Google's multimodal AI model built for agentic tasks, reasoning, and native tool use.

4.8 (4)
Freemium
S
#16

Self-Operating Computer

Open-source AI agent that operates your computer through screen vision and mouse/keyboard control.

4.7 (6)
Freemium
#17

Gemini

Google's multimodal AI model family with long-context understanding and MoE architecture.

4.2 (5)
Freemium
#18

AssiPilot

All-in-one AI assistant for creating images, videos, voiceovers, and music

4.6 (5)
Freemium
#19

Seedance 2 AI Video Generator

Multimodal AI video generator that turns text, images, and audio into short cinematic clips.

4.5 (6)
Free