Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

text-generation-inference

AutoTrain Compatible

4-bit precision

8-bit precision

Mixture of Experts

Misc with no match

text-embeddings-inference

Carbon Emissions

Models

627

Full-text search

Active filters: multimodal

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • Updated 8 days ago • 3.39M • • 667

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • Updated 27 days ago • 1.02M • 264

Qwen/Qwen2.5-VL-72B-Instruct

Image-Text-to-Text • Updated 7 days ago • 302k • • 370

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • Updated Feb 6 • 1.32M • • 1.15k

Minthy/ToriiGate-v0.4-7B

Image-Text-to-Text • Updated Jan 22 • 244 • 23

bytedance-research/UI-TARS-7B-DPO

Image-Text-to-Text • Updated Jan 25 • 18.7k • 156

Qwen/Qwen2.5-VL-7B-Instruct-AWQ

Image-Text-to-Text • Updated 26 days ago • 169k • 38

lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • Updated Oct 25, 2024 • 61.8k • 82

Qwen/Qwen2.5-VL-3B-Instruct-AWQ

Image-Text-to-Text • Updated 26 days ago • 24k • 24

Qwen/Qwen2.5-VL-72B-Instruct-AWQ

Image-Text-to-Text • Updated 7 days ago • 168k • 40

jinaai/jina-clip-v2

Zero-Shot Image Classification • Updated 7 days ago • 95.6k • 203

NCSOFT/VARCO-VISION-14B

Image-Text-to-Text • Updated Dec 31, 2024 • 889 • 26

Minthy/ToriiGate-v0.4-2B

Image-Text-to-Text • Updated Jan 19 • 198 • 9

huihui-ai/Qwen2.5-VL-3B-Instruct-abliterated

Image-Text-to-Text • Updated 5 days ago • 1.31k • 8

huihui-ai/Qwen2.5-VL-7B-Instruct-abliterated

Image-Text-to-Text • Updated 5 days ago • 22.7k • 6

openvla/openvla-7b

Image-Text-to-Text • Updated Sep 16, 2024 • 108k • 100

lmms-lab/llava-onevision-qwen2-7b-ov

Text Generation • Updated Sep 2, 2024 • 111k • 48

robotics-diffusion-transformer/rdt-1b

Robotics • Updated Oct 17, 2024 • 3.07k • 76

Qwen/Qwen2-VL-7B

Image-Text-to-Text • Updated Jan 12 • 77.5k • 48

erax-ai/EraX-VL-7B-V1.0

Image-Text-to-Text • Updated Jan 15 • 1.05k • 37

rhymes-ai/Aria

Image-Text-to-Text • Updated Jan 27 • 21.9k • 619

NexaAIDev/OmniVLM-968M

Updated Dec 17, 2024 • 1.37k • 513

CogACT/CogACT-Base

Robotics • Updated Dec 4, 2024 • 6.48k • 11

lmstudio-community/Qwen2-VL-7B-Instruct-GGUF

Image-Text-to-Text • Updated Jan 6 • 7.54k • 5

GoodiesHere/Apollo-LMMs-Apollo-7B-t32

Video-Text-to-Text • Updated Dec 18, 2024 • 527 • 54

bytedance-research/UI-TARS-72B-DPO

Image-Text-to-Text • Updated Jan 25 • 25.5k • 97

unsloth/Qwen2.5-VL-7B-Instruct-unsloth-bnb-4bit

Image-Text-to-Text • Updated 5 days ago • 32.9k • 21

imageomics/bioclip

Zero-Shot Image Classification • Updated May 17, 2024 • 52.7k • 46

HuggingFaceM4/idefics-80b

Text Generation • Updated Oct 12, 2023 • 153 • 70

HuggingFaceM4/idefics-80b-instruct

Text Generation • Updated Oct 12, 2023 • 3.59k • 184