Context
128K tokens (~64 books)
Input $/1M
$0.10
Output $/1M
$0.20
Type
multimodal
License
Open Source
Benchmarks
0 tested
Data updated today
About
UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...
No benchmark data available yet.
Links
Research
Documentation
Community
Source Code
BenchGecko API
ui-tars-1-5-7b
Specifications
- Typemultimodal
- Context128K tokens (~64 books)
- ReleasedJul 2025
- LicenseOpen Source
- StatusActive
- Cost / Message~$0.000
Available On
Share & Export
Frequently Asked Questions
UI-TARS 7B is an open-source multimodal AI model by bytedance, released in July 2025. Context window: 128K tokens.