Beta
Home/Models/Phi 4 Multimodal Instruct
Microsoft logo

Phi 4 Multimodal Instruct

by Microsoft · Released Feb 2025

Open Source
Compare
Context
N/A
Input $/1M
TBD
Output $/1M
TBD
Type
automatic-speech-recognition
License
Open Source
Benchmarks
1 tested
Data updated today
About

Microsoft automatic speech recognition model. 308K downloads on HuggingFace.

Tested on 1 benchmarks with 0.0% average. Top scores: Artificial Analysis — Quality Index (10.0%).

Capabilities
speed
16.7
#57 globally
Benchmark Scores
Compare All
Tested on 1 benchmarks · Ranked across 1 categories
Score Distribution (all 231 models)
0255075100
Artificial Analysis — Quality Index

Artificial Analysis Quality Index. Composite quality score combining multiple benchmark results into a single metric.

10.0
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Documentation
Community
BenchGecko API
microsoft-phi-4-multimodal-instruct
Specifications
  • Typeautomatic-speech-recognition
  • ContextN/A
  • ReleasedFeb 2025
  • LicenseOpen Source
  • StatusActive
Available On
Microsoft logoMicrosoftTBD
Categories
Share & Export
Tweet
Phi 4 Multimodal Instruct is an open-source automatic-speech-recognition AI model by Microsoft, released in February 2025.