Home/Models/UI-TARS 7B
bytedance logo

UI-TARS 7B

by bytedance · Released Jul 2025

Open SourceMultimodal
Compare
Context
128K tokens (~64 books)
Input $/1M
$0.10
Output $/1M
$0.20
Type
multimodal
License
Open Source
Benchmarks
0 tested
Data updated today
About

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

No benchmark data available yet.

Links
Documentation
Community
BenchGecko API
ui-tars-1-5-7b
Specifications
  • Typemultimodal
  • Context128K tokens (~64 books)
  • ReleasedJul 2025
  • LicenseOpen Source
  • StatusActive
  • Cost / Message~$0.000
Available On
bytedance logobytedance$0.10
Share & Export
Tweet
UI-TARS 7B is an open-source multimodal AI model by bytedance, released in July 2025. Context window: 128K tokens.