Padding of labels bug?
1
#44 opened about 21 hours ago
by
haukurpj
Detected version 0.0.0. Error: FlashAttention2
3
#43 opened 1 day ago
by
the1Domo
Experience with Phi-4-Multimodal vs. Whisper-1 for Speech-to-Text
1
#39 opened 3 days ago
by
hdevio
Questions about fine-tuning strategy and hyperparameters for Korean ASR/AST tasks
2
#37 opened 5 days ago
by
junnei
Does the model support beam search for ASR?
6
#31 opened 8 days ago
by
h9LtLSb
`tokenizer.model` is missing
1
#30 opened 8 days ago
by
happyme531

Getting Bounding Boxes for Vision
1
#29 opened 8 days ago
by
sujan2023
How to use this model as `Real Time API` may be using `WebSockets`?
#24 opened 11 days ago
by
mahimairaja

how to input a video?
9
#16 opened 11 days ago
by
maltoseflower
Error during inference with image and text.
5
#12 opened 14 days ago
by
aarbelle

llama.cpp support when?
10
#7 opened 15 days ago
by
alanzhuly

How to use it with LM Studio?
6
#3 opened 15 days ago
by
neokiller62
thanks , how to fine tune?
15
#1 opened 16 days ago
by
NickyNicky
