Context
N/A
Input $/1M
TBD
Output $/1M
TBD
Type
automatic-speech-recognition
License
Open Source
Benchmarks
1 tested
Data updated today
About
Microsoft automatic speech recognition model. 308K downloads on HuggingFace.
Tested on 1 benchmarks with 0.0% average. Top scores: Artificial Analysis — Quality Index (10.0%).
Capabilities
speed
16.7
#57 globally
Benchmark Scores
Compare AllTested on 1 benchmarks · Ranked across 1 categories
Score Distribution (all 231 models)
0255075100
speedCompare speed →
Artificial Analysis — Quality Index
10.0—Artificial Analysis Quality Index. Composite quality score combining multiple benchmark results into a single metric.
Excellent (85+) Good (70-85) Average (50-70) Below (<50)
Links
Info
Research
Documentation
Community
Source Code
BenchGecko API
microsoft-phi-4-multimodal-instruct
Specifications
- Typeautomatic-speech-recognition
- ContextN/A
- ReleasedFeb 2025
- LicenseOpen Source
- StatusActive
Available On
Categories
Learn More
Share & Export
Frequently Asked Questions
Phi 4 Multimodal Instruct is an open-source automatic-speech-recognition AI model by Microsoft, released in February 2025.