Earlier this month, Microsoft unveiled a new benchmark called Windows Agent Arena, designed to provide a platform for testing AI agents in realistic Windows operating system environments. Early ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results