microsoft/Phi-4-multimodal-instruct · Discussions

Resources

View closed (31)

Padding of labels bug?

#44 opened about 21 hours ago by

Detected version 0.0.0. Error: FlashAttention2

#43 opened 1 day ago by

Experience with Phi-4-Multimodal vs. Whisper-1 for Speech-to-Text

#39 opened 3 days ago by

Questions about fine-tuning strategy and hyperparameters for Korean ASR/AST tasks

#37 opened 5 days ago by

Does the model support beam search for ASR?

#31 opened 8 days ago by

`tokenizer.model` is missing

#30 opened 8 days ago by

Getting Bounding Boxes for Vision

#29 opened 8 days ago by

How to use this model as `Real Time API` may be using `WebSockets`?

#24 opened 11 days ago by

how to input a video?

#16 opened 11 days ago by

Error during inference with image and text.

#12 opened 14 days ago by

llama.cpp support when?

#7 opened 15 days ago by

How to use it with LM Studio?

#3 opened 15 days ago by

thanks , how to fine tune?

#1 opened 16 days ago by